Introducing the chemical-specific AI inference model 'ether0' capable of scientifically correct molecular design, applying reinforcement learning to existing language models to achieve performance that exceeds that of humans and existing AI



FutureHouse , a nonprofit organization developing AI agents for science and technology, has announced the launch of ether0 , an AI inference model specialized for chemical tasks. ether0 is a reinforcement learning model based on Mistral-Small-24B-Instruct-2501 (Mistral Small 3) , and can provide highly accurate responses to questions about molecules.

ether0: a scientific reasoning model for chemistry | FutureHouse
https://www.futurehouse.org/research-announcements/ether0-a-scientific-reasoning-model-for-chemistry

TRAINING A SCIENTIFIC REASONING MODEL FOR CHEMISTRY
(PDF file) https://storage.googleapis.com/aviary-public/ether0_preprint.pdf

Inference models such as OpenAI o3 and Claude Opus 4 can score higher than humans in benchmark tests that include questions about chemistry. However, they are not good at answering practical questions about molecules, and sometimes respond with scientifically impossible molecular structures. Ether0 was developed to solve this problem.

ether0 is a model that uses reinforcement learning and fine-tuning based on Mistral AI's inference model 'Mistral-Small-24B-Instruct-2501', and is capable of processing molecular questions with high accuracy. Below are the results of outputting the structure of 'C 27 H 37 N 3 O 4 ' to ether0 (left), OpenAI o3 (center), and Claude Opus 4 (right). OpenAI o3 and Claude Opus 4 output incorrect structures, but ether0 was able to output the scientifically correct structure.



The graph below shows the accuracy rate when multiple AI models, including ether0, and humans were asked to solve chemistry problems. The left side of the graph shows the accuracy rate for free-form questions, and the right side shows the accuracy rate for multiple-choice questions. As you can see from the graph, ether0 outperforms humans and other AI models in free-form questions.



In addition, ether0 could output the thoughts leading up to a direct answer to a question, and the content was scientifically convincing.




Sam Rodriques , CEO of FutureHouse, points out that ether0 is notable for its ability to learn more efficiently than specialized models. The graph below shows the progress of learning on the horizontal axis and the accuracy rate on the vertical axis, and it can be seen that ether0 has high accuracy from an early stage.



However, ether0 is merely a prototype model, and there are problems such as 'performance declines in problems other than molecular problems' and 'due to the influence of training data, it outputs incorrect answers in some tasks.' Nevertheless, Rodriques appeals, 'This research can be said to be a proof of concept that shows that with the right training data, language models can achieve superhuman performance in scientific problems very efficiently.'




The model data for ether0 is distributed at the following link.

futurehouse/ether0 · Hugging Face
https://huggingface.co/futurehouse/ether0



You can also try running ether0 on the following page:

ether0 Chemical Reasoner
https://ether0.platform.futurehouse.org/



in Software,   Science,   , Posted by log1o_hf