publications

A list of academic publications on machine learning systems, on-device AI, and high-performance inference engine design.

2025

  1. MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection
    Zhengxiang Huang, Chaoyue Niu, Zhaode Wang, and 8 more authors
    2025
  2. MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference
    Kunxi Li, Zhonghua Jiang, Zhouzhou Shen, and 5 more authors
    2025

2024

  1. MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices
    Zhaode Wang, Jingbang Yang, Xinyu Qian, and 4 more authors
    In Proceedings of the 6th ACM International Conference on Multimedia in Asia Workshops, Dec 2024

2022

  1. Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning
    Chengfei Lv, Chaoyue Niu, Renjie Gu, and 17 more authors
    Dec 2022