Autonomous vehicles for micro-mobility

Henrik Christensen, David Paz, Hengyuan Zhang, Dominique Meyer, Hao Xiang, Yunhai Han, Yuhan Liu, Andrew Liang, Zheng Zhong, Shiqi Tang

Autonomous Intelligent Systems ›› 2021, Vol. 1 ›› Issue (1) : 11. DOI: 10.1007/s43684-021-00010-2
Original Article

Autonomous vehicles for micro-mobility

Author information +
History +


Autonomous vehicles have been envisioned for more than 100 years. One of the first suggestions was a front cover of Scientific America back in 1916. Today, it is possible to get cars that drive autonomously for extended distances. We are also starting to see micro-mobility solutions, such as the Nuro vehicles for pizza delivery. Building autonomous cars that can operate in urban environments with a diverse set of road-users is far from trivial. Early 2018 the Contextual Robotics Institute at UC San Diego launched an effort to build a full stack autonomous vehicle for micro-mobility. The motivations were diverse: i) development of a system for operation in an environment with many pedestrians, ii) design of a system that does not rely on dense maps (or HD-maps as they are sometimes named), iii) design strategies to build truly robust systems, and iv) a framework to educate next-generation engineers. In this paper, we present the research effort of design, prototyping, and evaluation of such a vehicle. From the evaluation, several research directions are explored to account for shortcomings. Lessons and issues for future work are additionally drawn from this work.


Autonomous vehicles / Intelligent systems / Micro-mobility

Cite this article

Download citation ▾
Henrik Christensen, David Paz, Hengyuan Zhang, Dominique Meyer, Hao Xiang, Yunhai Han, Yuhan Liu, Andrew Liang, Zheng Zhong, Shiqi Tang. Autonomous vehicles for micro-mobility. Autonomous Intelligent Systems, 2021, 1(1): 11


E. D. Dickmanns, A. Zapp, in Mobile Robots, SPIE, 727, ed. by Wolfe W.J., Marquina N.A Curvature-based Scheme for Improving Road Vehicle Guidance by Computer Vision (Bellingham, 1987), pp. 161–168.
DickmannsE.. Dynamic Vision for Perception and Control of Motion, 2007 Heidelberg Springer Verlag
TurkM. A., MorgenthalerD. G., GrembanK. D., MarraM.. VITS - A Vision System for Autonomous Land Vehicle Navigation. IEEE Trans. Pattern Anal. Mach. Intell., 1988, 10(3):342-361
CrossRef Google scholar
B. Marr, Key milestones of Waymo - Google’s self-driving cars (2018). Accessed 02 Oct 2021.
I. Bonifacic, Toyota is developing autonomous taxis with help from Aurora (2021). Engadget. Accessed 02 Oct 2021.
A. Palmer, Amazon Zoox unveils self-driving robotaxi (2020). CNBC, Accessed 02 Oct 2021.
A. Adler, Aurora closes in on production version of self-driving truck technology (2021). FreightWaves, Accessed 02 Oct 2021.
Baidu-Apollo-team, Apollo: Open source autonomous driving (2017). Accessed 02 Oct 2021.
KatoS., TakeuchiE., IshiguroY., NinomiyaY., TakedaK., HamadaT.. An open approach to autonomous vehicles. IEEE Micro, 2015, 35(6):60-68
CrossRef Google scholar
D. Paz, P. -J. Lai, S. Harish, H. Zhang, N. Chan, C. Hu, S. Binnani, H. Christensen, in Field and Service Robotics. Lessons learned from deploying autonomous vehicles at UC San Diego (Tokyo, JP, 2019).
ZhangZ.. A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell., 2000, 22(11):1330-1334
CrossRef Google scholar
M. Quigley, K. Conley, B. Gerkey, J. Faust, T. Foote, J. Leibs, R. Wheeler, A. Ng, in ICRA Workshop on Open Source Software, 3. ROS: an open-source Robot Operating System, (2009).
A. H. Lang, S. Vora, H. Caesar, L. Zhou, J. Yang, O. Beijbom, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). PointPillars: Fast encoders for object detection from point clouds, (2019), pp. 12689–12697.
FischlerM., BollesR.. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 1981, 24: 381-395
CrossRef Google scholar
LepetitV., Moreno-NoguerF., FuaP.. EPnP: An accurate O(n) solution to the PnP problem. Int. J. Comput. Vis., 2009, 81: 155
CrossRef Google scholar
J. Redmon, A. Farhadi, YOLOv3: An Incremental Improvement. ArXiv abs/1804.02767 (2018).
C. Campos, R. Elvira, J. Rodríguez, J. Montiel, J. D. Tardós, ORB-SLAM3: An accurate open-source library for visual, visual-inertial and multi-map SLAM. ArXiv abs/2007.11898 (2020).
J. Zhang, S. Singh, in Robotics: Science and Systems. LOAM: Lidar odometry and mapping in real-time, (2014).
M. Magnusson, The three-dimensional normal-distributions transform — an efficient representation for registration, surface analysis, and loop detection. PhD dissertation, Örebro universitet (2009).
R. B. Rusu, S. Cousins, in IEEE International Conference on Robotics and Automation (ICRA). 3D is here: Point Cloud Library (PCL) (Shanghai, China, 2011).
H. Caesar, V. Bankiti, A. H. Lang, S. Vora, V. E. Liong, Q. Xu, A. Krishnan, Y. Pan, G. Baldan, O. Beijbom, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). nuScenes: A multimodal dataset for autonomous driving, (2020).
C. Ming-Fang, L. John, S. Patsorn, S. Jagjeet, B. Slawomir, H. Andrew, D. Wang, C. Peter, L. Simon, R. Deva, H. James, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Argoverse: 3D tracking and forecasting with rich maps, (2019), pp. 8740–8749.
DarweeshH., TakeuchiE., TakedaK., NinomiyaY., SujiwoA., MoralesY., AkaiN., TomizawaT., KatoS.. Open source integrated planner for autonomous navigation in highly dynamic environments. J. Robot. Mechatron., 2017, 29: 668-684
CrossRef Google scholar
R. C. Coulter, Implementation of the pure pursuit path tracking algorithm. Tech. rep. Carnegie-Mellon UNIV Pittsburgh PA Robotics INST (1992).
PazD., LaiP. -J., ChanN., JianfY., ChristensenH. I.. Autonomous vehicle benchmarking using unbiased metrics. International Conference on Intelligent Robots and Systems (IROS), 2020 Las Vegas, NV IEEE/RSJ
A. Gupta, J. Johnson, L. Fei-Fei, S. Savarese, A. Alahi, 2018, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks, pp. 2255–2264.
FernandoT., DenmanS., SridharanS., FookesC.. JawaharC. V., LiH., MoriG., SchindlerK.. GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds. Computer Vision – ACCV 2018, 2019 Cham Springer International Publishing 314-330
CrossRef Google scholar
J. Amirian, J. Hayet, Pettré J, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Social Ways: Learning multi-modal distributions of pedestrian trajectories with GANs, (2019), pp. 2964–2972.
A. Sadeghian, V. Kosaraju, A. Sadeghian, N. Hirose, H. Rezatofighi, S. Savarese, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). SoPhie: An attentive GAN for predicting paths compliant to social and physical constraints, (2019), pp. 1349–1358.
V. Kosaraju, A. Sadeghian, R. Martín-Martín, I. Reid, H. Rezatofighi, S. Savarese, in Advances in Neural Information Processing Systems, Curran Associates, Inc., 32, ed. by H. Wallach, H. Larochelle, A. Beygelzimer, F. d’ Alché-Buc, E. Fox, and R. Garnett. Social-bigat: Multimodal trajectory forecasting using bicycle-gan and graph attention networks, (2019).
T. Salzmann, B. Ivanovic, P. Chakravarty, M. Pavone, Trajectron++: Dynamically-feasible trajectory forecasting with heterogeneous data (2020).
X. Feng, Z. Cen, J. Hu, Y. Zhang, in 2019 IEEE Intelligent Transportation Systems Conference (ITSC). Vehicle trajectory prediction using intention-based conditional variational autoencoder, (2019), pp. 3514–3519.
A. Mohamed, K. Qian, M. Elhoseiny, C. Claudel, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Social-STGCNN: A social spatio-temporal graph convolutional neural network for human trajectory prediction, (2020), pp. 14412–14420.
J. Gao, C. Sun, H. Zhao, Y. Shen, D. Anguelov, C. Li, C. Schmid, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). VectorNet: Encoding hd maps and agent dynamics from vectorized representation, (2020), pp. 11522–11530.
A. Ribeiro, L. Dihl, C. Jung, in International Conference on Systems, Signals and Image Processing. Automatic camera calibration for driver assistance systems, (2006), pp. 173–176.
LuL., LuX., JiS., TongC.. A traffic camera calibration method based on multi-rectangle. Intelligent Information Processing VII, 2014 Berlin, Heidelberg Springer 230-238
WangH., CaiY., LinG., ZhangW.. A novel method for camera external parameters online calibration using dotted road line. Adv. Robot., 2014, 28: 1033-1042
CrossRef Google scholar
GeigerA., LenzP., StillerC., UrtasunR.. Vision meets robotics: The KITTI dataset. Int. J. Robot. Res. (IJRR), 2013, 32(11):1231-1237
CrossRef Google scholar
Y. Wu, A. Kirillov, F. Massa, W. -Y. Lo, R. Girshick, Detectron2 (2019). Accessed 02 Oct 2019.
MadsenC. B., ChristensenH. I.. Chapter 1. Modelling and testing the stability of edge segments: Length and orientation, 1995 Singapore World Scientific Press
K. He, G. Gkioxari, P. Dollár, R. Girshick, in 2017 IEEE International Conference on Computer Vision (ICCV). Mask R-CNN, (2017), pp. 2980–2988.
Grompone von GioiR., RandallG.. A Sub-Pixel Edge Detector: an Implementation of the Canny/Devernay Algorithm. Image Process. On Line, 2017, 7: 347-372
CrossRef Google scholar
Y. Han, Y. Liu, D. Paz, H. Christensen, Auto-calibration method using stop signs for urban autonomous driving applications. ArXiv abs/2010.07441 (2021).
ParkC., MoghadamP., KimS., SridharanS., FookesC.. Spatiotemporal Camera-LiDAR calibration: A targetless and structureless approach. IEEE Robot. Autom. Lett., 2020, 5: 1556-1563
CrossRef Google scholar
J. Kümmerle, T. Kühner, in 2020 IEEE International Conference on Robotics and Automation (ICRA). Unified intrinsic and extrinsic camera and LiDAR calibration under uncertainties, (2020), pp. 6028–6034.
A. Dhall, K. Chelani, V. Radhakrishnan, K. M. Krishna, LiDAR-Camera Calibration using 3D-3D Point correspondences. ArXiv e-prints 1705.09785 (2017).
S. Mishra, G. Pandey, S. Saripalli, in 2020 IEEE Intelligent Vehicles Symposium (IV). Extrinsic calibration of a 3D-LIDAR and a camera, (2020), pp. 1765–1770.
TorrP., ZissermanA.. MLESAC: A new robust estimator with application to estimating image geometry. Comput. Vis. Image Underst., 2000, 78: 138-156
CrossRef Google scholar
R. B. Rusu, Semantic 3D object maps for everyday manipulation in human living environments. PhD thesis, Computer Science department, Technische Universitaet Muenchen, Germany (2009).
T. Yin, X. Zhou, P. Krähenbühl, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Center-based 3D object detection and tracking, (2021), pp. 11784–11793.
B. Zhu, Z. Jiang, X. Zhou, Z. Li, G. Yu, Class-balanced grouping and sampling for point cloud 3D object detection. ArXiv abs/1908.09492 (2019).
BrummelenJ. V., O’BrienM., GruyerD. Najjaran H. Autonomous vehicle perception: The technology of today and tomorrow. Transp. Res. C Emerg. Technol., 2018, 89: 384-406
CrossRef Google scholar
ElfringJ., AppeldoornR., DriesS., KwakkernaatM.. Effective world modeling: Multisensor data fusion methodology for automated driving. Sensors (Basel, Switzerland), 2016, 16: 1668
CrossRef Google scholar
RangeshA., TrivediM. M.. No blind spots: Full-surround multi-object tracking for autonomous vehicles using cameras and lidars. IEEE Trans. Intell. Veh., 2019, 4(4):588-599
CrossRef Google scholar
Z. Ding, Y. Hu, R. Ge, L. Huang, S. Chen, Y. Wang, J. Liao, 1st place solution for Waymo open dataset challenge - 3D detection and domain adaptation. ArXiv abs/2006.15505 (2020).
H. Cho, Y. Seo, B. V. K. V. Kumar, R. R. Rajkumar, in 2014 IEEE International Conference on Robotics and Automation (ICRA). A multi-sensor fusion system for moving object detection and tracking in urban driving environments, (2014), pp. 1836–1843.
C. R. Qi, W. Liu, C. Wu, H. Su, L. J. Guibas, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Frustum PointNets for 3D object detection from RGB-D data, (2018), pp. 918–927.
D. Xu, D. Anguelov, A. Jain, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Pointfusion: Deep sensor fusion for 3D bounding box estimation, (2018), pp. 244–253.
X. Weng, J. Wang, D. Held, K. Kitani, 3D Multi-Object Tracking: A Baseline and New Evaluation Metrics, (2020).
H. -k Chiu, A. Prioletti, J. Li, J. Bohg, Probabilistic 3d multi-object tracking for autonomous driving. ArXiv abs/2001.05673 (2020).
D. Paz, P. J. Lai, N. Chan, Y. Jianf, H. I. Christensen, Probabilistic semantic mapping for urban autonomous driving applications (IEEE, 2020).
N. Homayounfar, W. C. Ma, J. Liang, X. Wu, J. Fan, R. Urtasun, in Proceedings of the IEEE/CVF International Conference on Computer Vision. Dagmapper: Learning to map by discovering lane topology, (2019), pp. 2911–2920.
N. Homayounfar, W. -C. Ma, S. K. Lakshmikanth, R. Urtasun, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Hierarchical recurrent attention networks for structured online maps, (2018), pp. 3417–3426.
S. Sengupta, P. Sturgess, L. Ladickỳ, P. H. Torr, in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. Automatic dense visual semantic mapping from street-level imagery (IEEE, 2012), pp. 857–862.
S. Sengupta, E. Greveson, A. Shahrokni, P. H. Torr, in 2013 IEEE International Conference on robotics and Automation. Urban 3D semantic modelling using stereo vision (IEEE, 2013), pp. 580–585.
T. Westfechtel, K. Ohno, R. P. B. Neto, S. Kojima, S. Tadokoro, in 2019 IEEE Intelligent Transportation Systems Conference (ITSC). Fusion of camera and lidar data for large scale semantic mapping (IEEE, 2019), pp. 257–264.
L. -C. Chen, Y. Zhu, G. Papandreou, F. Schroff, H. Adam, in Computer Vision - ECCV 2018. Lecture Notes in Computer Science, 11211, ed. by V. Ferrari, M. Hebert, C. Sminchisescu, and Y. Weiss. Encoder-decoder with atrous separable convolution for semantic image segmentation (Springer, 2018).
S. Hecker, D. Dai, L. Van Gool, in Proceedings of the european conference on computer vision (ECCV). End-to-end learning of driving models with surround-view cameras and route planners, (2018), pp. 435–453.
A. Amini, G. Rosman, S. Karaman, D. Rus, in 2019 International Conference on Robotics and Automation (ICRA). Variational end-to-end navigation and localization (IEEE, 2019), pp. 8958–8964.
D. Paz, H. Zhang, H. I. Christensen, TridentNet: A conditional generative model for dynamic trajectory generation, (Singapore, 2021).
HaklayM., WeberP.. Openstreetmap: User-generated street maps. IEEE Pervasive Comput., 2008, 7(4):12-18
CrossRef Google scholar
SohnK., YanX., LeeH.. Learning structured output representation using deep conditional generative models. Proceedings of the 28th International Conference on Neural Information Processing Systems, 2015 Cambridge MIT Press 3483-3491
T. Yang, Z. Nan, H. Zhang, S. Chen, N. Zheng, in 2020 IEEE Intelligent Vehicles Symposium (IV). Traffic agent trajectory prediction using social convolution and attention mechanism (IEEE, 2020), pp. 278–283.
AnguelovD., DulongC., FilipD., FruehC., LafonS., LyonR., OgaleA., VincentL., WeaverJ.. Google street view: Capturing the world at street level. Computer, 2010, 43(6):32-38
CrossRef Google scholar
ZhangP., ZhangM., LiuJ.. Real-time HD map change detection for crowdsourcing update based on mid-to-high-end sensors. Sensors, 2021, 21(7):2477
CrossRef Google scholar
ZhangY., YuanX., FangY., ChenS.. UAV low altitude photogrammetry for power line inspection. ISPRS Int. J. GEO-Inf., 2017, 6(1):14
CrossRef Google scholar
RemziE., AlkanE., AydinA.. A comparative analysis of UAV-RTK and UAV-PPK methods in mapping different surface types. Eur. J. For. Eng., 2020, 7(1):12-25
BarazzettiL., RoncoroniF., BrumanaR., PrevitaliM.. Georeferencing accuracy analysis of a single worldview-3 image collected over milan. XXIII ISPRS Congress, 2016, 38: 429-434
P. Daruthep, N. Sutthisangiam, in 2020 22nd International Conference on Advanced Communication Technology (ICACT). Development of automated processing for high-definition mapping system (IEEE, 2020), pp. 507–510.
A. Dosovitskiy, G. Ros, F. Codevilla, A. Lopez, V. Koltun, in Proceedings of the 1st Annual Conference on Robot Learning. CARLA: An open urban driving simulator, (2017), pp. 1–16.
G. Rong, B. Shin, H. Tabatabaee, Q. Lu, S. Lemke, M. Mozeiko, E. Boise, G. Uhm, M. Gerow, S. Mehta, E. Agafonov, T. H. Kim, E. Sterner, K. Ushiroda, M. Reyes, D. Zelenkovsky, S. Kim, in 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC). LGSVL Simulator: A high fidelity simulator for autonomous driving, (2020), pp. 1–6.




