Google Gemini Math surpasses o1 preview version! Cost is only 1/10, no extra thinking time is required, the old paradigm is not dead yet
Xiaojiao is from Aofei Temple
Quantum Bit | Public Account QbitAI
Math beats o1-preview at one-tenth the cost and with virtually no thinking delay!
On the same day that OpenAI's "Her" was fully released, Google Gemini 1.5 underwent a major upgrade.
In addition, the price is half of the original price, the speed limit is increased by 2-3 times, the output speed is increased by 2 times, and the delay is reduced to one third of the original price.
Developers can access it for free through Google AI Studio and Gemini API. The chat version will have to wait.
However, some netizens also found the bright spot. Although his mathematical ability is very strong, he still failed to beat o1-mini and o1 full version (94.8).
Google Gemini 1.5 major upgrade
There are two models updated this time: Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002 .
In summary, the main updates are:
-
For 1.5pro (input and output are both less than 128K), the price reduction is more than 50%.
-
The rate limit is increased by 2-3 times;
-
Output speed increased by 2 times and latency reduced by 3 times;
-
Updated default filter settings.
First, the overall performance is improved, especially in mathematics, long texts, and multimodality.
The performance on MMLU-Pro is improved by about 7%; and in the MATH and HiddenMath (internally retained competition math problem sets) benchmarks, both models have significant improvements of about 20%, with the Pro version surpassing o1-preview (85.5%) with a score of 86.5%.
In addition, there is a 2%-7% improvement in the evaluation of visual understanding and code generation.
Based on developer feedback, both models now feature a cleaner style, with the goal of making these models easier to use and lowering their cost.
For use cases such as summarization, question answering, and extraction, the default output length of the updated models is 5-20% shorter than the previous models.
In terms of price, the 1.5pro input token has been reduced by 64%, the output token has been reduced by 52%, and the incremental cache token has been reduced by 64%, which will take effect on October 1st.
The rate limit has also been increased. The paid rate limit of 1.5 Flash has been increased from 1000RPM to 2000RPM; the rate limit of 1.5 Pro has been increased from 360RPM to 1000RPM.
In addition, the output speed is increased by 2 times and the delay is reduced to one third of the original.
For new models, filters have been switched to optional and are not applied by default.
Finally, there is the Gemini 1.5 Flash-8B experimental version update, which has significant improvements in text and multimodal capabilities.
Netizens tested it out
Some netizens tested it so easily.
He tested the audio transcription function of Gemini 1.5 Flash, which was able to transcribe 13 minutes of audio in 50-60 seconds.
In the test results of multiple audio files, the transcription accuracy is close to 99%. If the audio is clear, the accuracy can reach 100%.
Some netizens tested its visual comprehension ability and it passed it successfully, which had previously stumped many visual models.
However, the most discussed aspect is its improvement in mathematical ability.
However, some netizens said that the mathematical benchmark is useless. It is saturated and pollutes the training data of most models. In the real-world mathematical problems, these are still not comparable to the O1 series.
However, there is another use for Google's upgraded model.
That is to push OpenAI to release a new model as soon as possible to "regain the crown."
When will the full version of o1 be released? (Doge)
Reference link:
[1]https://developers.googleblog.com/en/updated-production-ready-gemini-models-reduced-15-pro-pricing-increased-rate-limits-and-more/
[2]https ://www.reddit.com/r/singularity/comments/1fohi2z/gemini_15_002_beats_o1preview_on_math_and_it_does/
-over-
In the selection
「2024 Artificial Intelligence Annual Selection」
The registration channel for the QuantumBit 2024 Artificial Intelligence Annual Awards has been opened. The awards have been divided into five categories based on the three dimensions of Enterprise , Person , and Product .
Welcome to scan the QR code to sign up for the selection! The selection results will be announced at the MEET2025 Smart Future Conference in December . We look forward to witnessing the honorary moment with millions of practitioners.
Click here ???? Follow me, remember to mark the star~