The British journal Nature published an artificial intelligence research result: an American team reported a type of reinforcement learning that can look back to the past and solve complex tasks, which truly improved the way of exploring complex environments and is expected to be applied in the fields of robotics, language understanding and drug design. This type of algorithm is collectively called "Go-Explore", and it has already scored higher than human players and advanced artificial intelligence systems in an algorithmic challenge of a classic game. This achievement is considered to be an important step towards realizing a true "intelligent learning body".
Reinforcement learning allows artificial intelligence systems to make decisions by exploring and understanding complex environments, and learn how to obtain rewards in the best way. Rewards can include robots reaching a certain location or reaching a certain level in a computer game. However, when encountering complex environments with little feedback, current reinforcement learning algorithms can easily run into obstacles, which is very distressing for artificial intelligence experts.
OpenAI is an artificial intelligence non-profit organization jointly established by many Silicon Valley giants. Its promoters include Sam Altman, president of the American startup incubator Y Combinator, and Elon Musk, founder of the American Space Technology Exploration Company (SpaceX). Its goal is to prevent the catastrophic impact of artificial intelligence and promote the positive role of artificial intelligence. This time, OpenAI scientists Edran Ekfet, Just Huizinga and their team proposed two major obstacles to effective exploration and designed a class of algorithms to solve these obstacles.
The researchers said that "Go-Explore" can fully explore the environment while building an archive to remember where it has been, ensuring that it does not forget the route to a promising mid-term stage or final victory (reward). Its scores in Atari classic games exceeded those of human players and advanced artificial intelligence systems. The researchers used this type of algorithm to solve 2,600 Atari games that had not been solved before, verifying the potential of this type of algorithm. "Go-Explore" scored four times as much in the algorithm challenge "Montezuma's Revenge" as before, and also scored higher than the average level of human players in another algorithm challenge "Maya's Adventure". In contrast, previous algorithms did not get any points.
The "Go-Explore" algorithm was also able to complete a simulated robotics task in which it had to use a robotic arm to pick up an object and place it on one of four shelves, two of which were behind two doors.
The researchers note that the simple principle of remembering and returning to promising areas of exploration is a powerful, general approach to exploration, and they believe their new algorithm has potential applications in robotics, language understanding, and drug design.
Previous article:What robot events are worth paying attention to in February 2021?
Next article:Strategy Analytics: Pandemic drives demand, global service robot shipments surge
- Popular Resources
- Popular amplifiers
- Huawei's Strategic Department Director Gai Gang: The cumulative installed base of open source Euler operating system exceeds 10 million sets
- Analysis of the application of several common contact parts in high-voltage connectors of new energy vehicles
- Wiring harness durability test and contact voltage drop test method
- Sn-doped CuO nanostructure-based ethanol gas sensor for real-time drunk driving detection in vehicles
- Design considerations for automotive battery wiring harness
- Do you know all the various motors commonly used in automotive electronics?
- What are the functions of the Internet of Vehicles? What are the uses and benefits of the Internet of Vehicles?
- Power Inverter - A critical safety system for electric vehicles
- Analysis of the information security mechanism of AUTOSAR, the automotive embedded software framework
Professor at Beihang University, dedicated to promoting microcontrollers and embedded systems for over 20 years.
- LED chemical incompatibility test to see which chemicals LEDs can be used with
- Application of ARM9 hardware coprocessor on WinCE embedded motherboard
- What are the key points for selecting rotor flowmeter?
- LM317 high power charger circuit
- A brief analysis of Embest's application and development of embedded medical devices
- Single-phase RC protection circuit
- stm32 PVD programmable voltage monitor
- Introduction and measurement of edge trigger and level trigger of 51 single chip microcomputer
- Improved design of Linux system software shell protection technology
- What to do if the ABB robot protection device stops
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- Blind guess: How does the ESP32 “Piranha Plant” design achieve the opening and closing of its petals?
- The calculation problem of frequency count value
- 13.56MHz ultra-low power contactless card reader chip Si522_Provide technical support
- EEWORLD University Hall----Sharing the application solution of sensorless FOC portable refrigerator based on Lingdong MM32SPIN series MCU
- I was fired for refusing to bring my computer home during the holidays, but the result was comfortable!
- DIY crystal radio that does not require power
- 【Smart Network Desk Lamp】3. ESP32-S2 + lvgl usage
- Is there any replacement for ATSAME51J20A?
- Low-side current sensing for high-performance, cost-sensitive applications
- Analog Characteristics of Digital Circuits