HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 1 year ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Blaed@lemmy.world · 1 year ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

Blaed@lemmy.world · edit-2 1 year ago

I used to feel the same way until I found some very interesting performance results from 3B and 7B parameter models.

Granted, it wasn’t anything I’d deploy to production - but using the smaller models to prototype quick ideas is great before having to rent a gpu and spend time working with the bigger models.

Give a few models a try! You might be pleasantly surprised. There’s plenty to choose from too. You will get wildly different results depending on your use case and prompting approach.

Let us know if you end up finding one you like! I think it is only a matter of time before we’re running 40B+ parameters at home (casually).