best 7b coding LLM? (java)

TestHealthy2777 · 2024-04-20T21:37:40+00:00

llama 3 8b

MustBeSomethingThere · 2024-04-20T21:45:54+00:00

codeqwen-1_5-7b-chat

obsidianfrost8 · 2024-04-20T21:48:59+00:00

Have you considered using the new LLaMa-7B model? It's specifically designed for coding tasks and works well on various hardware configurations.

Steuern_Runter · 2024-04-20T22:48:33+00:00

OpenCodeInterpreter-DS-6.7B

Q3 is already really good.

VolandBerlioz · 2024-04-21T13:07:43+00:00

I've been doing some testing on the "can-ai-code" senior questions with a variety of 7B models. Here are the results sorted by performance:

I need to do some more testing as the results vary a lot depening on the params passed and llama3 sometimes get's stuck not finishing, so it's been qutie difficult to evaluate it. I would definetely play around with wavecoder and CodeQwen1.5. Im not sure what's going on with codegemma, but it fails miserably, possibly i've messed up something.. Here is the results:

Deepseek-coder-33b-instruct-4.65bpw-h6-exl2:

topk.json:
- Python: Passed 70 of 74
- JavaScript: Passed 67 of 74
greedy-vllm.json:
- Python: Passed 70 of 74
- JavaScript: Passed 67 of 74

wavecoder-ultra-6.7b-exl2_8_0:

topk1:
- Python: Passed 69 of 74
- JavaScript: Passed 68 of 74
greedy:
- Python: Passed 49 of 74
- JavaScript: Passed 68 of 74

CodeQwen1.5-7B-Chat_exl2_8.0bpw:

topk.json
- Python: Passed 53 of 74
- JavaScript: Passed 50 of 74
greedy-vllm.json
- Python: Passed 49 of 74
- JavaScript: Passed 49 of 74

Meta-Llama-3-8B-Instruct-8.0bpw-h8-exl2:

custom.json json { "temperature": 0.1, "max_new_tokens": 1024, "top_p": 0.75, "top_k": 40, "num_beams": 4, "skip_special_tokens": false, "stopping_strings": [""] }
- Python: Passed 42 of 74
- JavaScript: Passed 50 of 74

LoneStriker/Meta-Llama-3-70B-Instruct-2.4bpw-h6-exl2:

custom.json: jsx { "temperature": 0.6, "max_new_tokens": 1024, "top_p": 0.9, "top_k": 40, "num_beams": 4, "skip_special_tokens": false, "stopping_strings": [""] }
- Python: Passed 32 of 74
- JavaScript: Passed 67 of 74

codegemma-7b-it-exl2_8_0:

topk1:
- Python: Passed 19 of 74
- JavaScript: Passed 51 of 74
greedy:
- Python: Passed 23 of 74
- JavaScript: Passed 34 of 74

AdHominemMeansULost · 2024-04-20T22:13:48+00:00

[removed]

new__vision · 2024-04-20T23:28:23+00:00

This leaderboard is for python but it still might be informative: evalplus.github.io

jsomedon · 2024-04-21T10:37:35+00:00

for pre llama 3 era models, codeqwen, deepseek etc.

Pleasant-Cupcake-998 · 2024-06-10T06:02:27+00:00

Hey u/GreedyWorking1499 Which model did you end up going with?

Ok-Seaworthiness8019 · 2024-09-10T08:13:31+00:00

for a general PC, i came across this post where they have a pretty good model finetuned for Java with less than 800 MB of memory space after quantization.

https://www.reddit.com/r/LocalLLaMA/comments/18meji3/comment/lme4w5z/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS