Microsoft Releases 1.3 Bn Parameter Language Model, Outperforms LLaMa
cross-posted from: lemmy.world/post/422633
I wonder if high quality datasets is the future as opposed to using internet scraped data that might produce lower quality output. Either way, neat model!
Add comment