Chengkun Li

Check out my projects

My Blog

Read My Latest Posts

Read

Chengkun (Charlie) Li

PhD Student in Computer Science

Swiss Federal Institute of Technology Lausanne (EPFL)

Brief Bio

I’m a PhD student in Computer Science at , working with Prof. Alexander Mathis in the Mathis Group for Computational Neuroscience and AI. My research focuses on multimodal and continual learning for embodied systems, connecting perception, reasoning, and motor control.

Before my PhD, I completed an MSc in Robotics at EPFL and a master’s thesis at on continual skill learning for ANYmal, supervised by Prof. Marco Hutter and Prof. Caglar Gulcehre. I was also a student researcher at Zurich, where I worked on InkSight, an open-source system for converting images of handwriting into digital ink.

Earlier, I worked at EPFL CVLab on efficient 6D object pose estimation with modular quantization-aware training, at ByteDance AI Lab on multimodal representation learning, and on projects in 3D perception and Cryo-ET generative modeling.

Outside of research, I enjoy board games 🎲, soccer ⚽, tennis 🎾, and music 🎶. Feel free to reach out to me if you want to join in on a hike or play some board games.

Work with me

I’m open to collaborations with doctoral students and motivated Master’s students interested in continual learning, multimodal systems, embodied AI, and computational neuroscience.

If you have a solid deep learning foundation and want to work on these topics, I’d love to chat about potential research projects. EPFL Master’s students should also follow the instructions on our lab website.

Research Interests

Multimodal and Embodied Learning
Continual / Open-Ended Learning
Motor Control, Perception, and Reasoning

Industry Experience

Student Researcher, 2023 - 2024

Google Research
Computer Vision Research Intern, 2021

ByteDance AI Lab

Recent News

All news →

Jun 2026

Award

Winner of the CVPR 2026 OpenSUN3D Challenge

Winner of the Open-Vocabulary 3D Affordance Grounding track at the CVPR 2026 OpenSUN3D Challenge.

Challenge

Apr 2026

Paper

The behavior biopsy published in Current Opinion in Neurobiology

The behavior biopsy: Interpreting animal behavior as embodied, situated, and hierarchical - published in Current Opinion in Neurobiology. Co-first author with Andy Bonnetto and Sepideh Mamooler.

Dec 2025

Award

Won the NeurIPS 2025 MyoChallenge

Towards Human Athletic Intelligence — winning submission at NeurIPS 2025.

Talk

Jun 2025

Paper

InkSight accepted to TMLR

InkSight: Offline-to-Online Handwriting Conversion by Teaching Vision-Language Models to Read and Write — accepted to Transactions on Machine Learning Research.

Feb 2025

Milestone

Started my PhD at EPFL

Featured Publications

Visit my scholar page for full list

Andy Bonnetto*, Sepideh Mamooler*, Chengkun Li*, Alexander Mathis

April 2026 Current Opinion in Neurobiology Computational Neuroscience

The behavior biopsy: Interpreting animal behavior as embodied, situated, and hierarchical

A perspective on behavior as embodied, situated, and hierarchical, and on how physics engines, computer vision, and multimodal language models can support richer quantitative analysis of animal behavior.

DOI

Chengkun Li*, Cheryl Wang*, Bianca Ziliotto, Merkourios Simos, Jozsef Kovecses, Guillaume Durandau, Alexander Mathis

February 2026 Preprint Embodied AI

Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale

An open-source framework for scalable motion imitation learning with physiologically realistic, muscle-actuated humanoids — two validated musculoskeletal embodiments (126- and 416-muscle) plus GPU-parallel training of generalist motor policies.

PDF Code

^ⓡFirst authors (random order generated by AEA tool), ^†Correspondence, Blagoj Mitrevski^ⓡ, Arina Rak^ⓡ, Julian Schnitzler^ⓡ, Chengkun Li^ⓡ, Andrii Maksai^†, Jesse Berent, Claudiu Musat

June 2025 Transactions of Machine Learning Research

InkSight: Oﬄine-to-Online Handwriting Conversion by Teaching Vision-Language Models to Read and Write

Our work aims to bridge the gap between images of handwriting and digital ink with a Vision Language Model (PaLI). To our knowledge, this is the first work that effectively does so with arbitrary photos with diverse visual characteristics and backgrounds. Furthermore, it generalizes beyond its training domain and can work on simple sketches. Human evaluation reveals that 87% of the samples produced by our model on the challenging HierText dataset are considered valid tracings of the input image, and 67% look like pen trajectories traced by a human.

PDF Code Google Research Blog Dataset Poster Video 🤗 Hugging Face Demo 📦 Model Release

See all publications