Artificial intelligence
Upstage's AI beats global peers in benchmark math tests
Its MathGPT outperformed Microsoft's ToRA math-specific large language model in two global tests
By Jan 08, 2024 (Gmt+09:00)
1
Min read
Most Read
LG Chem to sell water filter business to Glenwood PE for $692 million


Kyobo Life poised to buy Japan’s SBI Group-owned savings bank


KT&G eyes overseas M&A after rejecting activist fund's offer


StockX in merger talks with Naver’s online reseller Kream


Mirae Asset to be named Korea Post’s core real estate fund operator



South Korean artificial intelligence tech startup Upstage said on Monday that its math-specific large language model (LLM), jointly developed with local startup Masspresso and telecom leader KT Corp., has outperformed Microsoft Corp.’s ToRA in two global math benchmark tests.
Upstage’s MathGPT achieved 0.488 out of a full score of 1 in the latest MATH benchmark test for LLMs having 13 billion parameters or less. The test is based on a dataset of 12,500 challenging math problems.
The Korean model outperformed OpenAI's LLM GPT-4, which scored 0.425, chatbot ChatGPT's 0.355 and ToRA's 0.481, Upstage said.
In the GSM8K benchmark, or Grade School Math 8K, MathGPT topped the LLM list. The Korean AI scored 0.782, beating ToRA’s 0.758. The benchmark is based on a dataset of 8,500 high quality, linguistically diverse grade school math word problems.
Math has been a difficult field in which to apply LLMs due to the need for logical reasoning and abstract thinking.
Upstage has been developing the math-specific LLM with Masspresso, the operator of the AI-backed learning platform Qanda, since last year. This is part of the two AI startups’ partnership with KT, which last September invested 10 billion won ($7.6 million) in each of the tech ventures to strengthen its hyperscale AI capabilities.
Masspresso, which collects around 10 million data on math problems and explanations per day, has provided Upstage with the dataset.
KT operates Korea’s largest graphics processing unit (GPU) farm, a set of servers that allocate resources to quickly perform calculations, to accelerate the two startups’ math-specific LLM development.
Upstage will lead the innovation of generative AI in math and other domains with its global top LLM tech, said Chief Executive Kim Seong-hoon.
AI in the global edtech industry, which has been at the level of Google search, will be upgraded with MathGPT, said a Quanda official.
Write to Kang-Ho Jang at autonomy@hankyung.com
Jihyun Kim edited this article.
More to Read
-
Artificial intelligenceUpstage to develop math-specific LLM with Qanda
Nov 15, 2023 (Gmt+09:00)
2 Min read -
Artificial intelligenceUpstage to develop shopping-specific AI with ConnectWave
Sep 12, 2023 (Gmt+09:00)
1 Min read -
Artificial intelligenceS.Korea's KT invests in domestic startups Upstage, Qanda
Sep 11, 2023 (Gmt+09:00)
1 Min read -
Artificial intelligenceS.Korean LLM by Upstage beats global benchmark ChatGPT
Aug 01, 2023 (Gmt+09:00)
3 Min read -
Artificial intelligenceUpstage’s AskUp offers Korea’s first GPT-4-powered chatbot service
Mar 17, 2023 (Gmt+09:00)
2 Min read
Comment 0
LOG IN