title

swordsmanluke, 9 months ago to programming in Mistral 7B AI Model Released Under Apache 2.0 License

This looks really interesting!

Some recent studies have shown that (for the performance demonstrated) most models are nowhere near as compact as they could/should be. This means that we should expect an explosion in the capability of small models like this as new techniques find ways to improve our models.

Unfortunately, I couldn’t find a recommendation for how much VRAM you need to run this model, though it does call out being able to run it locally, which is awesome!

I’ll try it out after work and see if it can run on an old 8GB 2070. 😄

reply

report

activity

copy /kbin url

copy original url

Loading...

e0qdk, 9 months ago

It's not clear to me either on exactly what hardware is required for the reference implementation, but there's a bunch of discussion about getting it to work with llama.cpp in the HN thread, so it might be possible soon (or maybe already is?) to run it on the CPU if you're willing to wait longer for it to process.

Let us know how it goes!

reply

report

activity

copy /kbin url

copy original url

Loading...

TheChurn, 9 months ago

how much VRAM you need to run this model

It will depend on the representation of the parameters. Most models support bfloat16, where each parameters is 16-bits (2 Bytes). For these models, every Billion parameters needs roughly 2 GB of VRAM.

It is possible to reduce the memory footprint by using 8 bits for each param, and some models support this, but they start to get very stupid.

reply

report

activity

copy /kbin url

copy original url

Loading...

Sigmatics, 9 months ago

That would mean 16GB are required to run this one

reply

report

activity

copy /kbin url

copy original url

Loading...

e0qdk, 9 months ago to programming in Mistral 7B AI Model Released Under Apache 2.0 License

Lots of discussion on HN: https://news.ycombinator.com/item?id=37675496

reply

report

activity

copy /kbin url

copy original url

Loading...

Federation

Status:

On | Off

Instances:

/d/mistral.ai

Threads

Comments

Domain

mistral.ai

Active people

Random posts

Asia Pacific Patient Engagement Solutions Share, Size, Trends, Demands, Key Players and Analysis: 2029...

28 days ago to science

Hospital Mobile X-Ray Market Industry Trends, Share and Future Growth 2031...

28 days ago to random

Activated Carbon Market Growth, Development and Demand Forecast Report 2029...

1 month ago to science

Air Quality Sensor Market Size, Share, Trend, Forecast And Analysis 2030...

Random threads

Empower Your Vision: Fictive Studios - Leading App Developers in Houston

4 months ago to android

Mutfaktan AL

8 months ago to random

Airport Security Market:The forecast outlines expected changes in market size, structure, and competitive landscape, providing valuable insights for stakeholders to make informed decisions.

15 days ago to random

The Olympic Pool (AC Odyssey on PC @ 1920x1080)

10 months ago to photomode

mistral.ai

Top