Chengkun Li

Check out my projects

My Blog

Read My Latest Posts

Read

Chengkun (Charlie) Li

PhD Student

Swiss Federal Institutef Technology Lausanne (EPFL)

Brief Bio

I’m a PhD student at supervised by Prof. Alexander Mathis. Previously I completed my MSc in Robotics at EPFL. My research interests lie at the intersection of efficient multimodal learning, reasoning, and actuation.

I did my master’s thesis at and , supervised by Prof. Marco Hutter and Prof. Caglar Gulcehre. I joined as a student researcher at in 2024, worked as a core contributor to the project InkSight. I also had the pleasure of working as a research intern at ByteDance AI Lab in 2021, where I focused on multimodal representation learning.

I had the pleasure to work in the POSS lab of Prof. Huijing Zhao at Peking University on 3D perception, Xu Lab of Prof. Min Xu at Carnegie Mellon University on Cryo-ET generative models, CVLab with Dr. Mathieu Salzmann at EPFL on neural networks quantization for object detection and pose estimation.

Outside of research, I enjoy board games 🎲, soccer ⚽, tennis 🎾, and music 🎶. Feel free to reach out to me if you want to join in on a hike or play some board games.

Work with me

I’m looking to mentor passionate Master’s students who want to explore how we can make AI systems learn and adapt. My research sits at the intersection of neuroscience, robotics, and AI - specifically looking at continual learning and multimodal systems.

If you have a good foundation in deep learning and get excited about these topics, I’d love to chat about potential research projects!

Research Interests

Multimodal Learning
Efficient and Continuous Learning / Open ended Learning
Perception and Reasoning

Industry Experience

Student Researcher, 2023 - 2024

Google Research
Computer Vision Research Intern, 2021

ByteDance AI Lab

Featured Publications

Visit my scholar page for full list

^ⓡFirst authors (random order generated by AEA tool), ^†Correspondence, Blagoj Mitrevski^ⓡ, Arina Rak^ⓡ, Julian Schnitzler^ⓡ, Chengkun Li^ⓡ, Andrii Maksai^†, Jesse Berent, Claudiu Musat

June 2025 Transactions of Machine Learning Research

InkSight: Oﬄine-to-Online Handwriting Conversion by Teaching Vision-Language Models to Read and Write

Our work aims to bridge the gap between images of handwriting and digital ink with a Vision Language Model (PaLI). To our knowledge, this is the first work that effectively does so with arbitrary photos with diverse visual characteristics and backgrounds. Furthermore, it generalizes beyond its training domain and can work on simple sketches. Human evaluation reveals that 87% of the samples produced by our model on the challenging HierText dataset are considered valid tracings of the input image, and 67% look like pen trajectories traced by a human.

PDF Code Google Research Blog Dataset Poster Video 🤗 Hugging Face Demo 📦 Model Release

Saqib Javed, Chengkun Li, Andrew Price, Yinlin Hu, Mathieu Salzmann

November 2024 Transactions on Machine Learning Research Quantization

Modular Quantization-Aware Training for 6D Object Pose Estimation

Edge applications, such as collaborative robotics and spacecraft rendezvous, demand efficient 6D object pose estimation on resource-constrained embedded platforms. Existing 6D object pose estimation networks are often too large for such deployments, necessitating compression while maintaining reliable performance. To address this challenge, we introduce Modular Quantization-Aware Training (MQAT), an adaptive and mixed-precision quantization-aware training strategy that exploits the modular structure of modern 6D object pose estimation architectures. MQAT guides a systematic gradated modular quantization sequence and determines module-specific bit precisions, leading to quantized models that outperform those produced by state-of-the-art uniform and mixed-precision quantization techniques. Our experiments showcase the generality of MQAT across datasets, architectures, and quantization algorithms. Additionally, we observe that MQAT quantized models can achieve an accuracy boost (>7% ADI-0.1d) over the baseline full-precision network while reducing model size by a factor of 4x or more.

PDF Code Project Page

See all publications

Experience

PhD Student

Mathis Group for Computational Neuroscience and AI, EPFL

Feb 2025 – Present

Supervisor: Alexander Mathis | Lab Repository: https://github.com/amathislab

Thesis Student

Robotics Systems Lab, ETH Zürich

Apr 2024 – Sep 2024

Thesis Title: Continuous Skill Learning For ANYmal Robot

Supervisors: Chenhao Li, Nikita Rudin, Skander Moalla, Marco Hutter, Caglar Gulcehre (hosting supervisors from CLAIRE, EPFL)

Student Researcher

Google Research, Zürich

Aug 2023 – Feb 2024

Worked on extending Vision-Language Model with ink modality.

Supervisors: Andrii Maksai, Claudiu Musat, Jesse Berent, Henry Rowley.

Google Research Blog | GitHub | HuggingFace | Project page

Semester Project Student

CVLab, EPFL

Dec 2022 – Aug 2023

Worked on modular quantization-aware training.

Supervisors: Saqib Javed, Mathieu Salzmann

MSc in Robotics

Swiss Federal Institute of Technology Lausanne

Sep 2021 – Oct 2024

Enrolled in MSc in Robotics program.

Research Intern

ByteDance Inc.

Mar 2021 – Jul 2021 Beijing, China

Research Intern at AI Lab Visual Computing group working on Multimodal Representation Learning.

Recent News

All news»

[June 2025] 🎉 Our paper InkSight (InkSight: Offline-to-Online Handwriting Conversion by Teaching Vision-Language Models to Read and Write) has been accepted to Transactions on Machine Learning Research. [OpenReview]
[Feb 2025] I have started my PhD at EPFL!
[Nov 2024] 🎉 Our paper MQAT (Modular Quantization-Aware Training for 6D Object Pose Estimation) has been accepted to Transactions on Machine Learning Research. [Project page]
[Oct 2024] 🎉 Our project InkSight was featured on: Google Research Blog, LinkedIn, Google AI, Hugging Face (AK’s post), and Hacker News.
[Sept 2024] 🎉 Successfully defended my thesis (grade 6.0/6.0). Grateful to my supervisors and committee for their invaluable support!

Chengkun Li

My Blog

Chengkun (Charlie) Li

PhD Student

Brief Bio

Research Interests

Industry Experience

Featured Publications

Experience

PhD Student

Thesis Student

Student Researcher

Semester Project Student

MSc in Robotics

Research Intern

Recent News

Contact