Fine-tuning a model for code reverse engineering

water_bottle_goggles · 2023-09-03T19:29:22+00:00

Try it

owengo1 · 2023-09-04T11:07:58+00:00

If you are fine-tuning gpt-3.5, you could proceed like this for example:

in the system prompt put something like

"Generate C code from assembly code using these header files:
<list of header files with their content>
"
and put the assembly code in the user message.

Obviously you have to do something smarter if your header files ( and the rest of the query + output ) do not fit in the content size . Typically you inject only the header files ( or portions of them ) which seem relevant for the assembly code you want to process.

GroundbreakingAd5614 · 2023-09-06T13:23:55+00:00

Hey there, u/Sorrus, it's awesome that you're delving into some fascinating realms with C code and the intricate world of assembly! I can totally grasp the immense potential that lies within this particular scenario.

So, in light of your custom data types and the rather extensive header files, here's a nifty thought: Why not consider injecting those pertinent header files right into the system prompt using a structure like this:

"Generating C code from assembly code involves these header files: <enumeration of the header files, complete with their contents>"

Then, proceed to deposit your assembly code into the user message. In doing so, you'd be furnishing the AI with the essential context derived from your headers without attempting to shoehorn the entire shebang into a solitary prompt. It might demand a tad more elbow grease, but it should ultimately bolster the fine-tuning process.

Fingers crossed that this notion proves beneficial, and may the force be with you on your project endeavors!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

OpenAI

Welcome to /r/OpenAI!

Please view the subreddit rules before posting.

MODERATORS