aiSim5 expands simulation testing scope based on generative AI (final)-EEWORLD

Collect

In the previous chapters, we discussed the confidence of aiSim simulation synthetic data. In addition, in the process of scene reconstruction and closed-loop testing, we will inevitably face problems such as time-consuming and high-cost 3D scene production and reconstruction, low scalability, and unsatisfactory traffic conditions. The current main challenge is how to automatically generate 3D static scenes and add dynamic instance editing, thereby effectively shortening the test process and expanding the scope of simulation testing.

wKgaomZNiUGACH0zAABL0_0fsNQ18.jpeg?source=d16d100b

Figure 1: Actual image wKgaomZNiUGABH0wAAA-X-ccnfA280.jpg?source=d16d100b

Figure 2: NeRF reconstruction scene

For 3D reconstruction, the two main solutions are NeRF and 3DGS.

1. NeRF

1. Neural Radiance Fields

NeRF encodes the color and density information of each point in three-dimensional space into a continuous function and parameterizes it by MLP. Given a viewpoint and a point in three-dimensional space, NeRF can predict the color of the point and the density distribution along the line of sight. By performing volume rendering on this information, NeRF can synthesize images from new viewpoints.

wKgZomZNiUWAZUeYARW_QeIaDmI089.gif?source=d16d100b

2. Advantages

High-fidelity output.

Based on NerFStudio, a more friendly code library is provided.
Relatively fast training time.
It is scalable for areas to be rebuilt.

3. Shortcomings and main challenges

Slow rendering. NeRF needs to perform a lot of sampling and calculations along each ray from the camera to the scene to accurately estimate the volume density and color of the scene. This process is computationally intensive, and it takes about 10 seconds to render an image at full HD resolution on an NVIDIA A100.

The scene depth estimation effect is not ideal. NeRF implicitly learns the depth information of the scene through volume rendering, but this depth information is usually coupled with the color and density information of the scene. This means that if there are complex situations such as occlusion or non-Lambertian reflection in the scene, NeRF may have difficulty accurately estimating the depth of each pixel.

wKgaomZNiUKAGoKzAAHdHVYkELY10.jpeg?source=d16d100b

Reconstruction quality of close objects may be low. This may be caused by insufficient view angle and resolution, inaccurate depth estimation, and motion blur occlusion.

wKgZomZNiUGAL9jqAAAyc4PLlrM00.jpeg?source=d16d100b

Ghosting artifacts caused by imperfect calibration of high FOV cameras.

wKgZomZNiUKAUsgGAAA9O87Ifug74.jpeg?source=d16d100b

Of course, in order to solve these problems, researchers introduced depth regularization to improve the accuracy and stability of NeRF depth estimation, and improved the rendering speed by optimizing the structure and algorithm of NeRF.

2.3DGS

1. 3D Gaussian Splatting

3DGS uses three-dimensional Gaussian distribution to represent point cloud data in the scene. Each point is described by a Gaussian function with mean and covariance. The Gaussian function is rendered by rasterization to generate realistic 3D scene images.

wKgaomZNiUaAfO9xARF-jfd6tiw046.gif?source=d16d100b

2. Advantages

The training time is short.

Near real-time rendering.

Provides high-fidelity output.

3. Shortcomings and main challenges

The code base is less friendly. Compared with NeRFStudio, the documentation is less complete and easier to use.

The initial point cloud acquisition has high requirements, requiring precise sensors and complex data processing procedures, otherwise it will have a significant impact on the performance of 3DGS.

wKgaomZNiUKAWR4mAABabi0v3BU54.jpeg?source=d16d100b

Depth estimation is also insufficient, which may be due to several reasons: the optimization process tends to optimize each Gaussian point independently, resulting in overfitting in a small number of images; the lack of global geometric information leads to inaccurate depth estimation in large scenes or when reconstructing complex geometric structures; the depth information of the initial point cloud is not accurate enough, etc.

wKgZomZNiUKAF8BbAAAtPgDvA8Y59.jpeg?source=d16d100b

Camera model support is limited. Currently, 3DGS mainly supports the pinhole camera model. Although 3DGS versions of other camera models can be derived in theory, subsequent experiments are needed to verify their effectiveness and accuracy.

The scalability of the reconstruction area is limited, mainly due to the incomplete reconstruction caused by the lack of geometric information outside the LiDAR coverage area and the large amount of computation required to reconstruct large urban scenes.

wKgZomZNiUKAMjSwAABg5f2-Rmc20.jpeg?source=d16d100b

Integration and resource-intensive challenges,Currently 3DGS integration usually relies on Python interfaces; 3DGS may occupy a large amount of VRAM when running.

Optimizing hyperparameters and adopting new methods, such as Scaffold-GS, may help reduce memory requirements and improve processing capabilities on large scenes.

3. Operation method

1. Training process

Step 1: Input - Camera video data; Vehicle motion data; Calibration data; LiDAR point cloud data for depth regularization;

Step 2: Remove dynamic objects: Create segmentation maps to identify and mask different objects and regions in the image; automatically annotate dynamic objects* (Kangmo aiData toolchain);

wKgaomZNiUKAPHj0AABR3iDpRRw00.jpeg?source=d16d100b

Step 3: Perform NeRF or Gaussian splatting.

NeRF:

Any camera model can be used, the example uses the MEI camera model;

Large-scale reconstruction using Block-NeRF;

Embedded in different climate conditions.

Gaussian splatting:

Convert the input camera into a pinhole camera model;

The initial point cloud can be obtained from COLMAP or LiDAR;

Large-scale reconstruction using Block-Splatting.

2. Add dynamic objects

After NeRF and 3DGS generate static scenes, aiSim5 will further add dynamic elements based on the external rendering API, which can not only reconstruct the original scene but also construct different traffic conditions according to test requirements.

wKgZomZNiUKAfGjJAAGn0IU-HmY664.png?source=d16d100b

NeRF/3DGS-based scene details in aiSim5.

wKgaomZNiUKAcihmAABPlQRBgdM82.jpeg?source=d16d100b

Figure 13: Mesh casting shadows wKgaomZNiUOAVX_KAABfuuIWSE477.jpeg?source=d16d100b

Figure 14: Ambient occlusion under the car

3. Effect display

After adding dynamic objects in aiSim5, you can freely change the traffic status in the map scene for SiL/HiL testing of perception/regulation systems.

wKgaomZNiUuAb-xcAdsV8tb-Pgg460.gif?source=d16d100b

Figure 15: aiSim5 running NeRF city scene 1 wKgZomZNiUiAR4gGAba78GiRSyk276.gif?source=d16d100b

Figure 16: aiSim5 running NeRF city scene 2

Reference address：aiSim5 expands simulation testing scope based on generative AI (final)

Previous article：What is the function of a chopper circuit? What are its applications in automobiles?
Next article：Simcenter Vehicle Energy Management Solutions – Accelerate innovation using virtual prototyping

Recommended ReadingLatest update time:2024-11-16 09:57

Advantech and NVIDIA deepen cooperation and become the global distributor of NVIDIA AI Enterprise software

Taipei, April 2024 - Advantech announced that it has expanded its collaboration with NVIDIA to become the first company in Taiwan to receive NVIDIA AI Enterprise certification for its software platform for developing and deploying production-grade AI applications, including generative AI. The recently released N

[Industrial Control]

Advantech and NVIDIA deepen cooperation and become the global distributor of NVIDIA AI Enterprise software

A design scheme for a full-time AI TV system based on voice interaction application

Natural human-computer interaction based on human-centered body, posture, action, expression, and voice recognition technology has begun to be applied to smart TVs . Voice can access and exchange information in the most natural and direct way. Voice information input, recognition, and implementation can greatly redu

[Embedded]

A design scheme for a full-time AI TV system based on voice interaction application

How does AI drive robotics innovation? Listen to what ABB has to say!

With the development of artificial intelligence technology, industrial robots are accelerating their evolution towards autonomy, intelligence and collaboration. As a pioneer in the field of global industrial robots , ABB has identified the trend, made bold innovations, and made in-depth layouts in mu

[robot]

Shanghai AI customized bus is here, how to ride it?

AI is penetrating into all walks of life. Using AI to customize public transportation is one of them. Shanghai's AI customized buses are already on the road. Cheaper than taking a taxi, on time, with seats, and directly from home to the office, would you like to take such a bus? Starting from September 16, Shanghai

[Embedded]

Gartner: By 2026, 30% of enterprises will be unable to provide reliable isolation due to AI-generated deepfakes

Gartner: It is estimated that by 2026, 30% of enterprises will believe that identity authentication and verification solutions cannot provide reliable isolation due to AI-generated deepfakes. Gartner predicts that by 2026, 30% of enterprises will believe that such authentication and verification solutions are no lon

[Industrial Control]

NVIDIA's chip power consumption control system based on artificial intelligence technology

3D graphics cards began their first transformation in 1999 when NVIDIA launched GeForce 256. This hardware light and shadow conversion technology greatly improved the computer's image display performance, had an important revolutionary significance in the field of gaming, and also brought users a very high sense of pi

[Mobile phone portable]

NVIDIA's chip power consumption control system based on artificial intelligence technology

The impact of artificial intelligence (AI) on the automotive industry

First, a few points about Artificial Intelligence (AI): It is a wrong name! AI is neither artificial nor intelligent. AI cannot recognize things without a lot of human training. AI exhibits completely different logic than humans in recognizing, understanding and classifying objects or scenes. The label implies that AI

[Embedded]

The impact of artificial intelligence (AI) on the automotive industry

Intel and the industry create a full-stack AI solution to accelerate the development of commercial AI PCs

March 26, 2024, Beijing - Today, Intel held the "2024 New Intel Commercial Client AI PC Product Launch Conference", extending the AI features based on Intel® Core™ Ultra processors to the commercial field, bringing innovations in commercial computer technology. At the scene, Intel shared the products and i

[Network Communication]

Intel and the industry create a full-stack AI solution to accelerate the development of commercial AI PCs

Popular Resources
Popular amplifiers