Artificial intelligence writes poems but struggles with math. Why can't ChatGPT and other chatbots handle even basic arithmetic? We reveal the causes of AI's mathematical mistakes, from tokenization which breaks numbers into unintelligible fragments, to the statistical learning approach that fails in mathematics.
Artificial intelligence, including ChatGPT, can write poems, compose music, and translate texts. Yet, it often stumbles on simple mathematical tasks. Why can't a chatbot, that handles complex language tasks, deal with math at an elementary school level?
One of the key problems is tokenization. This process divides data into smaller parts, called tokens. Imagine it like assembling a puzzle, where words are broken down into syllables. The tokenizer, the AI model responsible for this process, does not understand the meaning of numbers.
It may happen that the number 380 is perceived as one token, while 381 is perceived as two (38 and 1). This disrupts the relationships between digits and complicates the calculation.
Another reason for ChatGPT's mathematical difficulties is its statistical nature. The chatbot learns based on a vast amount of examples and looks for patterns in them. For instance, it learns that the phrase "Dear Sir" is often followed by the phrase "we are reaching out to you".
However, this approach faces challenges in mathematics. ChatGPT can guess that the product of numbers ending in 2 will end in 4, but it cannot handle intermediate results. Simply put, the ChatGPT model tries to guess the result based on learned patterns instead of performing a precise calculation.
A study conducted by Yuntian Deng from the University of Waterloo showed that ChatGPT struggles with multiplying numbers greater than four digits. The reason is that any error in a calculation step shows up in the final result.
Imagine it as a domino effect – one error triggers a chain reaction, and the result is completely off. However, there is hope that ChatGPT will improve in the future. Deng and his colleagues also tested the o1 model from OpenAI, which is characterized by logical reasoning capabilities.
This model achieved significantly better results than the standard GPT-4o and was able to correctly solve multiplications of nine-digit numbers. The o1 model thinks through the problem step by step, allowing for more accurate results.
Alice is an educational platform that allows children and students to delve into the world of programming through creating 3D animations, interactive stories, and simple games. It is suitable for both schoolchildren and university users. What does it offer and how does it work?
The American government has launched an investigation into the Chinese company TP-Link, which controls 65% of the router market. The reason is national security concerns following the use of their devices in ransomware attacks.
OpenAI concluded its Christmas event "12 Days of OpenAI" by announcing the revolutionary model o3 and its smaller version o3-mini. The new model promises significant improvements in reasoning and solving complex tasks. For now, it will only be available to safety researchers.
SpaceX, in collaboration with New Zealand operator One NZ, has launched the first nationwide satellite network for sending SMS messages. This groundbreaking service allows communication even in areas without traditional mobile signal. Currently, it supports only four phone models and message delivery time can take up to 10 minutes.
Tynker is a modern platform that teaches kids to program in a fun way. With the help of visual blocks, they can create their own games, animations or control robots. The platform supports creativity, logical thinking and allows kids to explore technology in a playful way. Find out how it works and what makes it better or worse than other platforms.
Digital blackout. ChatGPT, Sora, Instagram, and Facebook were down. Millions of users were left without access to their favorite services. The outages revealed the fragility of the online world and dependency on technology. OpenAI struggled with server issues, while Meta dealt with a global outage. What is happening behind the walls of the tech giants?