Rick's Cafe AI 8:38 pm on November 7, 2022
Tags: NLP ( 486 )

SetFit outperforms GPT-3 while being 1600x smaller

Everyone is very familiar with the current hype around Large Language Models (LLM) such as GPT-3 and Image Generation models such as DALL-E 2 and Stable diffusion. However, the results of these models come at a price.

GPT-3: ~$12 Million
DALL-E: ~$500k- $1 million
Stable Diffusion: ~$600k

This is due to the large number of GPU’s required to process and train these models. Besides the cost, the amount of labelled data required to achieve these results is difficult to source. Previously it was not possible for startups to train models of this calibre because of these two factors until now— introducing Sentence Transformer Fine-tuning (SetFit) a simple and efficient alternative for few-shot text classification unveiled by the teams at Intel Labs, UKP Labs and Hugging Face. Read More

#nlp

Recent Activity

s: search
c: compose new post
r: reply
e: edit
t: go to top
j: go to the next post or comment
k: go to the previous post or comment
o: toggle comment visibility
esc: cancel edit post or comment

Design a site like this with WordPress.com

Get started

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Rick's Cafe AI

The latest in Artificial Intelligence carefully curated into its own special blend

Daily Archives: November 7, 2022

SetFit outperforms GPT-3 while being 1600x smaller