About Me

I am a Principal Applied Scientist at Amazon AGI, leading the development of large-scale multimodal foundation models in the Nova family. My work spans encoder and multimodal embeddings, M-LLM training and evaluation, with a focus on video, cross-modal reasoning, and unified omni-model architectures.

Previously, I was a Staff Research Scientist at ByteDance and a Senior Applied Scientist at AWS AI, leading multimodal and video modeling efforts deployed in production.
I received my Ph.D. from Rutgers University in 2018 and my B.S. from the University of Electronic Science and Technology of China in 2013.


Updates

Model Release

  • Nova 2 Family: Multimodal reasoning and generation models. Technical Report
  • Nova Multimodal Embedding: State-of-the-art multimodal embeddings for agentic RAG and semantic search across video, image, document, and audio. Technical Report
  • Nova 1 Family: Amazon’s first generation multimodal foundation models. Nova 1 and Nova 1 Premier

Publications