Over the past decade, quadcopter drones have evolved dramatically. Today, they are widely used in all areas of life, from government inspections and accident mapping to filming, express delivery, and search and rescue.
At least from a commercial or research perspective, you can buy a good drone for just a few thousand dollars, and all you need to do is teach it to fly. But if you want a quadcopter to achieve stable takeoff, you usually need to spend a lot of time on parameter debugging, program development, and flight tests.
Recently, researchers from New York University and TII (Agile Robotics and Perception Laboratory and Technology Innovation Institute) provided new ideas for simplifying the flight process of drones, and successfully sent robots into the sky through rapid simulation on consumer-grade laptops.
By applying deep reinforcement learning and simulation technology, the system they developed can teach a quadcopter drone that has never flown how to take off and hover stably in just 18 seconds. Not only is the takeoff time short, but the hardware requirements for the entire process are also very low: a MacBook Pro can do it.
Deep reinforcement learning, “learning without a teacher”
If you want a computer to independently control complex quadcopter flight, the key is to implement algorithms for autonomous learning and decision-making.
What is reinforcement learning? Simply put, it is a training method that interacts with the environment. The algorithm tries different actions and adjusts its strategy based on the rewards or penalties brought by the actions to maximize the accumulated rewards during the training process.
Researchers do not need to define a complex control model in advance, or manually specify all the characteristics and parameters required for flight. The algorithm will autonomously explore the optimal strategy, allowing the quadrotor to learn without a teacher.
Simulation environment, safe and efficient
The most direct way to teach a quadcopter to fly is to test it in real life and adjust the algorithm based on the flight results. However, the risks and costs of doing so are extremely high. First, it requires a lot of money, and second, each test flight and crash is a loss of equipment.
The solution proposed by the New York University research team is to build a digital simulation platform that is extremely realistic with the real environment.
In this simulated world, the quadcopter can take off, hover, and land continuously, and the drone can obtain status information and make decisions just like in reality. In the event of an accident, you only need to reset the system without worrying about equipment damage.
Such an efficient and secure training environment provides great convenience for algorithm iteration. The team also took advantage of the hardware acceleration provided by Apple M series processors to further shorten the training time and greatly improve the simulation efficiency.
Course-driven, effectively avoiding overfitting
In addition to deep reinforcement learning and simulation technology, the "curriculum-driven" strategy designed in this study also played an important role in rapid training.
The so-called curriculum-driven approach is to dynamically adjust the reward mechanism according to different stages of the training process. At the beginning, more behaviors are rewarded to encourage the algorithm to explore sufficiently. In the later stage, only the optimization goals are rewarded, and the penalties are gradually increased to emphasize robustness and reliability.
The benefit of this approach is that it can effectively avoid overfitting to a specific environment. Some systems perform well in complex simulations, but fail in real scenarios because they rely too much on certain simulation parameters. Dynamically adjusting the reward target can encourage the algorithm to focus on the truly important performance indicators.
And within 18 seconds, not only does the take-off take off, in fact the quadcopter can already maintain a stable hover, avoid nudges and fly along a specific trajectory.
18 seconds, just the beginning
Although the number of 18 seconds is amazing, you may wonder: Is training completed in just 18 seconds really enough to cope with various unknown situations in complex environments?
Indeed, there are still some limitations. For example, the existing training is still too idealistic and cannot cover all situations, such as the response ability under different load conditions and the impact of different ambient light on vision. The training samples still need to be further expanded.
But looking to the future, 18 seconds is just the beginning. Perhaps the next 18 seconds will bring more unpredictable surprises.
Previous article:[Veco Cup] Source Control Electronics participated in the "Veco Cup OFweek 2023 China Robotics Industry Annual Core Components Innovation Product Award"
Next article:Orders dropped by 30%?
- Popular Resources
- Popular amplifiers
- Using IMU to enhance robot positioning: a fundamental technology for accurate navigation
- Researchers develop self-learning robot that can clean washbasins like humans
- Universal Robots launches UR AI Accelerator to inject new AI power into collaborative robots
- The first batch of national standards for embodied intelligence of humanoid robots were released: divided into 4 levels according to limb movement, upper limb operation, etc.
- New chapter in payload: Universal Robots’ new generation UR20 and UR30 have upgraded performance
- Humanoid robots drive the demand for frameless torque motors, and manufacturers are actively deploying
- MiR Launches New Fleet Management Software MiR Fleet Enterprise, Setting New Standards in Scalability and Cybersecurity for Autonomous Mobile Robots
- Nidec Drive Technology produces harmonic reducers for the first time in China, growing together with the Chinese robotics industry
- DC motor driver chip, low voltage, high current, single full-bridge driver - Ruimeng MS31211
- Innolux's intelligent steer-by-wire solution makes cars smarter and safer
- 8051 MCU - Parity Check
- How to efficiently balance the sensitivity of tactile sensing interfaces
- What should I do if the servo motor shakes? What causes the servo motor to shake quickly?
- 【Brushless Motor】Analysis of three-phase BLDC motor and sharing of two popular development boards
- Midea Industrial Technology's subsidiaries Clou Electronics and Hekang New Energy jointly appeared at the Munich Battery Energy Storage Exhibition and Solar Energy Exhibition
- Guoxin Sichen | Application of ferroelectric memory PB85RS2MC in power battery management, with a capacity of 2M
- Analysis of common faults of frequency converter
- In a head-on competition with Qualcomm, what kind of cockpit products has Intel come up with?
- Dalian Rongke's all-vanadium liquid flow battery energy storage equipment industrialization project has entered the sprint stage before production
- Allegro MicroSystems Introduces Advanced Magnetic and Inductive Position Sensing Solutions at Electronica 2024
- Car key in the left hand, liveness detection radar in the right hand, UWB is imperative for cars!
- After a decade of rapid development, domestic CIS has entered the market
- Aegis Dagger Battery + Thor EM-i Super Hybrid, Geely New Energy has thrown out two "king bombs"
- A brief discussion on functional safety - fault, error, and failure
- In the smart car 2.0 cycle, these core industry chains are facing major opportunities!
- The United States and Japan are developing new batteries. CATL faces challenges? How should China's new energy battery industry respond?
- Murata launches high-precision 6-axis inertial sensor for automobiles
- Ford patents pre-charge alarm to help save costs and respond to emergencies
- New real-time microcontroller system from Texas Instruments enables smarter processing in automotive and industrial applications
- [TI mmWave Radar Evaluation] 5+AWR1443 EVM Board Sensitivity (Minimum Distance) Test and Summary
- 【ESP32-S2-Kaluga-1 Review】Getting weather information via https
- LE audio, Bluetooth positioning, Bluetooth mesh... Where will the next technological explosion point of Bluetooth be? ?
- RAM is equivalent to the computer's running memory. Can the memory stick also be expanded?
- LPS25HB Barometric Pressure Sensor PCB Package and Code
- Can the glass fiber effect be quantified through simulation?
- In Cadence, I want to reserve space for components in PCB design. How do I do it in the principle?
- Circuit Schematic Analysis
- Does anyone know about the FDS9435 chip?
- Button control water flow light: press key1, LED1-8 will flow in sequence, press key2, LED1/3/5/7 (4 intervals)...