LG AI Study has actually revealed EXAONE Deep, a thinking version that masters intricate analytic throughout mathematics, scientific research, and coding.
The business highlighted the worldwide obstacle in producing innovative thinking versions, keeping in mind that presently, just a handful of organisations with fundamental versions are proactively seeking this intricate location. EXAONE Deep intends to complete straight with these leading versions, showcasing an affordable degree of thinking capability.
LG AI Study has actually concentrated its initiatives on significantly boosting EXAONE Deep’s thinking capacities in core domain names. The version additionally shows a solid capability to comprehend and use understanding throughout a wider series of topics.
The efficiency criteria launched by LG AI Study go over:
- Mathematics: The EXAONE Deep 32B version surpassed a completing version, regardless of being just 5% of its dimension, in a requiring math standard. Moreover, the 7.8 B and 2.4 B variations attained starting point in all significant math criteria for their corresponding version dimensions.
- Scientific research and coding: In these locations, the EXAONE Deep versions (7.8 B and 2.4 B) have actually safeguarded the leading area throughout all significant criteria.
- MMLU (Substantial Multitask Language Comprehending): The 32B version attained a rating of 83.0 on the MMLU standard, which LG AI Study asserts is the most effective efficiency amongst residential Oriental versions.
The capacities of the EXAONE Deep 32B version have actually currently amassed worldwide acknowledgment.
Soon after its launch, it was consisted of in the ‘Noteworthy AI Designs’ listing by US-based charitable study organisationEpoch AI This listing positions EXAONE Deep along with its precursor, EXAONE 3.5, making LG the only Oriental entity with versions included on this prominent listing in the previous 2 years.

Maths expertise
EXAONE Deep has actually shown phenomenal mathematical thinking abilities throughout its different version dimensions (32B, 7.8 B, and 2.4 B). In evaluations based upon the 2025 school year’s math educational program, all 3 versions surpassed worldwide thinking versions of similar dimension.
The 32B version attained a rating of 94.5 in a basic math expertise examination and 90.0 in the American Invitational Math Evaluation (AIME) 2024, a certifying examination for the United States Mathematical Olympiad.
In the AIME 2025, the 32B version matched the efficiency of DeepSeek-R1– a substantially bigger 671B version. This outcome showcases EXAONE Deep’s reliable understanding and solid rational thinking capacities, specifically when dealing with difficult mathematical issues.
The smaller sized 7.8 B and 2.4 B versions additionally attained leading positions in significant criteria for light-weight and on-device versions, specifically. The 7.8 B version racked up 94.8 on the MATH-500 standard and 59.6 on AIME 2025, while the 2.4 B version attained ratings of 92.3 and 47.9 in the exact same assessments.
Scientific research and coding quality
EXAONE Deep has actually additionally showcased impressive capacities in specialist scientific research thinking and software program coding.
The 32B version racked up 66.1 on the GPQA Ruby examination, which examines analytic abilities in doctoral-level physics, chemistry, and biology. In the LiveCodeBench assessment, which determines coding effectiveness, the version attained a rating of 59.5, suggesting its possibility for top-level applications in these specialist domain names.
The 7.8 B and 2.4 B versions proceeded this fad of solid efficiency, both safeguarding starting point in the GPQA Ruby and LiveCodeBench criteria within their corresponding dimension groups. This accomplishment builds on the success of the EXAONE 3.5 2.4 B version, which formerly covered Hugging Face’s LLM Readerboard in the side department.
Boosted basic understanding
Past its specialized thinking capacities, EXAONE Deep has actually additionally shown enhanced efficiency generally expertise understanding.
The 32B version attained a remarkable rating of 83.0 on the MMLU standard, placing it as the top-performing residential version in this detailed assessment. This suggests that EXAONE Deep’s thinking improvements prolong past particular domain names and add to a wider understanding of different topics.
LG AI Study thinks that EXAONE Deep’s thinking developments stand for a jump in the direction of a future where AI can deal with significantly intricate issues and add to enhancing and streamlining human lives via constant study and technology.
See additionally: Baidu undercuts rival AI models with ERNIE 4.5 and ERNIE X1

Intend to discover more concerning AI and large information from sector leaders? Have A Look At AI & Big Data Expo occurring in Amsterdam, The Golden State, and London. The detailed occasion is co-located with various other leading occasions consisting of Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.
Discover various other upcoming venture innovation occasions and webinars powered by TechForge here.
The blog post LG EXAONE Deep is a maths, science, and coding buff showed up initially on AI News.
发布者:Dr.Durant,转转请注明出处:https://robotalks.cn/lg-exaone-deep-is-a-maths-science-and-coding-buff/