Forget DeepSeek: Researchers develop a $50 OpenAI competitor in less than 30 minutes that thinks harder when you ask it to "wait"

s1 artificial intelligence reasoning model displayed on a smartphone
s1 AI reasoning model. (Image credit: Getty Images | NurPhoto)

The emergence of DeepSeek and its R1 V3-powered AI model, which surpasses OpenAI's o1 reasoning model across a wide range of benchmarks, including math, science, and coding, has raised investor concern about the exorbitant cost tied behind AI advances, seemingly making commitments such as OpenAI's $500 billion Stargate project seem counter-productive.

Researchers at Stanford and the University of Washington recently developed an AI model to take on OpenAI's o1 reasoning model. For more context, the model, dubbed s1, was trained using a dataset of 1,000 questions for under $50 (via TechCrunch). The researchers managed to achieve this milestone by distilling information from proprietary larger AI models.

Distillation is the process where a small AI model extracts information from larger AI models. In this case, the researchers indicated that s1 extracted its answers from Google's Gemini 2.0 Flash Thinking Experimental AI reasoning model. As spotted by The Verge, the tool's terms of service categorically indicate that it's prohibited to use Gemini's API to develop models that compete with the company's AI models.

The process narrows the gap between AI startups and well-established AI firms, as they can develop sophisticated entries without breaking the back. However, top AI labs, including OpenAI and Microsoft, by extension, aren't happy about smaller AI startups using distillation to refine their AI models. OpenAI and Microsoft recently accused DeepSeek of using their copyrighted data to train its ultra-cost-effective model.

s1's training process took less than 30 minutes using 16 NVIDIA H100 GPUs. The model is based on Qwen2.5, an open-source Alibaba AI model. More interestingly, the researchers revealed that they asked the AI model to "wait" during the reasoning process, prompting it to think harder before generating its response to the query. “This can lead the model to doublecheck its answer, often fixing incorrect reasoning steps,” the researchers noted. As a result, the AI model seemingly generated well-curated and accurate answers.

You can check out the s1 model on GitHub.

CATEGORIES
Kevin Okemwa
Contributor

Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya with lots of experience covering the latest trends and developments in the industry at Windows Central. With a passion for innovation and a keen eye for detail, he has written for leading publications such as OnMSFT, MakeUseOf, and Windows Report, providing insightful analysis and breaking news on everything revolving around the Microsoft ecosystem. You'll also catch him occasionally contributing at iMore about Apple and AI. While AFK and not busy following the ever-emerging trends in tech, you can find him exploring the world or listening to music.

Read more
Deepseek and ChatGPT logos
DeepSeek outperforms OpenAI's reasoning model at just 3% of the cost after President Trump's $500 billion Stargate AI initiative. “All I know is we keep pushing forward to make open-source AGI a reality for everyone🚀”
Artificial intelligence mobile app icons for DeepSeek, ChatGPT and Google Gemini arranged on a smartphone.
Meta AI lead scientist claims "open-source" is the secret ingredient to DeepSeek's triumph over proprietary models at a fraction of the cost — dethroning OpenAI's ChatGPT as the most downloaded free app in the US
Sam Altman in a courtroom setting
"I think it is pretty hopeless": DeepSeek proves OpenAI's previous dismissal of AI startups with only $10M funding wrong, reveling in success at a fraction of the budget
DeepSeek logo is seen on a mobile screen.
DeepSeek's $6 million R1 cost-efficient model training might be a ruse — the Chinese startup reportedly spent $1.6 billion and bought 50,000 NVIDIA GPUs while its top researchers earned $1.3 million
The X account of OpenAI CEO Sam Altman is displayed on a mobile phone with a ChatGPT logo.
Sam Altman says DeepSeek's R1 cost-effective AI is impressive, but OpenAI will "obviously deliver much better models" than it — "Look forward to bringing you all AGI and beyond."
The logos of OpenAI and DeepSeek artificial intelligence apps on mobile phones.
Is DeepSeek's AI a brand-new secondhand ChatGPT? A "unanimous jury" rules its AI-generated text matches OpenAI models by 74%
Latest in Software Apps
Photo of Microsoft's new sign-in page for Xbox.com using the Microsoft Edge browser.
Over one billion users will get a new Microsoft user experience, and it has a dark mode
Artificial intelligence mobile apps for DeepSeek, ChatGPT and Google Gemini arranged.
Google says its latest reasoning model is its "most intelligent" — but Microsoft's CEO claims Google already fumbled its AI opportunity
ChatGPT and Microsoft Logo
ChatGPT’s new image-generation tool is impressive; it can finally create a glass of wine filled to the brim — but it struggles with blank white images and appears to discriminate against 'sexy women'
Microsoft Edge Sidebar
My favorite Microsoft Edge feature just got an AI upgrade — is this the best way to use Copilot on Windows 11?
Professor Sir Roger Penrose, physicist, mathematician and cosmologist
Nobel laureate claims "AI will not be conscious" and shouldn't be considered intelligent — Until it develops its own ideas
In this photo illustration OpenAI ChatGPT icon is displayed on a mobile phone screen in Ankara, Turkiye on August 13, 2024.
OpenAI says an excessive dependency on ChatGPT can lead to loneliness and a "loss of confidence" in decision-making
Latest in News
Cloud servers
Microsoft has killed "several" data center projects in the U.S. and Europe, according to reports
Photo of Microsoft's new sign-in page for Xbox.com using the Microsoft Edge browser.
Over one billion users will get a new Microsoft user experience, and it has a dark mode
The Thing: Remastered key art
The Thing comes to Xbox Cloud Gaming's "Stream Your Own Game" library alongside other new arrivals
Promotional screenshot of heroes fighting a giant in Pillars of Eternity
Obsidian's classic Baldur's Gate successor 'Pillars of Eternity' is getting a surprise turn-based mode later this year, alongside other updates
Atomfall
Atomfall reviews and Metacritic scores are in: Here's a roundup of what everyone's saying about this new Game Pass survival game
Screenshot of one of the new flat world presets in Minecraft.
Minecraft testing new flat world presets and a better way to locate your friends in-game