Nowadays, we are more confident that removing lidar was a good choice. Why? Our new AI system is based on a large language model based on many data. The data are mostly short videos, cut from the road while the customer is driving. It is a short video, like 10 or 30 seconds short. Those videos are input for the AI system to train on, and that is how XNGP is upgraded. It’s learning like this, it’s learning from every car on the road.
The lidar data can’t contribute to the AI system.
Why?
Because there is only visual input, we call it VLA – vision, language, action. Lidar data are different and can’t be absorbed by the AI system. That is why our system grows very, very fast, because we can train it on so much road data.