Rick's Cafe AI 1:40 pm on June 19, 2023
Tags: Accuracy ( 35 )

No, GPT4 can’t ace MIT

A paper seemingly demonstrating that GPT-4 could ace the MIT EECS + Math curriculum recently went viral on twitter, getting over 500 retweets in a single day. Like most, we were excited to read the analysis behind such a feat, but what we found left us surprised and disappointed. Even though the authors of the paper said they manually reviewed the published dataset for quality, we found clear signs that a significant portion of the evaluation dataset was contaminated in such a way that let the model cheat like a student who was fed the answers to a test right before taking it.

We think this should call into greater question the recent flurry of academic work using Large Language Models (LLMs) like GPT to shortcut data validation — a foundational principle in any kind of science, and especially machine learning. These papers are often uploaded to Arxiv and widely shared on Twitter before any legitimate peer review. In this case, potentially spreading bad information and setting a poor precedent for future work. — Read More

#accuracy

Rick's Cafe AI 1:32 pm on June 19, 2023
Tags: ChatBots ( 202 ), DevOps ( 277 )

Run open-source LLMs on your computer. Works offline. Zero configuration.

Discover the remarkable capabilities of open-source LLMs on your personal computer. Operate seamlessly without an internet connection and with effortless setup. — Read More

#chatbots, #devops

Rick's Cafe AI 1:25 pm on June 19, 2023
Tags: ChatBots ( 202 )

GPT-4 Can Use Tools Now—That’s a Big Deal

… Earlier this week, OpenAI built tool use right into the GPT API with an update called function calling. It’s a little like a child’s ability to ask their parents to help them with a task that they know they can’t do on their own. Except in this case, instead of parents, GPT can call out to external code, databases, or other APIs when it needs to.

Each function in function calling represents a tool that a GPT model can use when necessary, and GPT gets to decide which ones it wants to use and when. This instantly upgrades GPT capabilities—not because it can now do every task perfectly—but because it now knows how to ask for what it wants and get it. — Read More

#chatbots

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30

Rick's Cafe AI

The latest in Artificial Intelligence carefully curated into its own special blend

Daily Archives: June 19, 2023

No, GPT4 can’t ace MIT

Run open-source LLMs on your computer. Works offline. Zero configuration.

GPT-4 Can Use Tools Now—That’s a Big Deal