Xingzhe He

Xingzhe He

I'm the technical lead for AI Research at Descript, leading video & audio generation research and multi-modal understanding.

Previously PhD at UBC with Helge Rhodin. Working on controllable generation and self-supervised structure discovery.

Email Scholar GitHub LinkedIn

About

I'm the technical lead for AI Research at Descript, where I lead teams working on video and audio generation research, and on multi-modal understanding that helps creators produce content at the speed of thought.

I completed my PhD at the University of British Columbia, advised by Prof. Helge Rhodin. My research centers on computer vision, machine learning, and generative models — for images, shapes, and physics.

Before UBC, I spent a year at Dartmouth College as a research intern advised by Prof. Bo Zhu, working on physics-based machine learning. I received my M.Sc. from Rutgers University, and B.Sc. from the University of Liverpool / Xi'an Jiaotong-Liverpool University, advised by Prof. Corina Constantinescu.

Productionized Research

Foundational Generative Models for LipSync

A state-of-the-art lipsync generation stack from scratch, including highly compressed video VAEs and diffusion generators. The system improved audio-visual alignment, identity preservation, and fine facial dynamics across challenging poses. Example videos.

Audio Inpainting Models

A state-of-the-art audio inpainting model from scratch, including highly compressed audio VAEs and diffusion generators. The system not only preserves the identity and talking pace, but also room tone, including background noise and music. Example audio clips.

Long-Horizon Video-to-Video Generation

A training-free video-to-video model that scales sequence length for consistency, supporting generation and editing of static-camera videos beyond two hours. Example videos.

Jumpcut Smoothing

A jumpcut smoothing model that fills gaps from trimming or re-timing clips with regenerated frames, so joins flow like continuous takes. Example videos.

Publications

Full list on Google Scholar →

Goodbye Drift: Anchored Tree Sampling for Long-Horizon Video-to-Video Generation

Technical Report

Matthew Bendel, Stephen W. Bailey, Mithilesh Vaidya, Sumukh Badam, Xingzhe He

paper code project

AutoLink: Self-Supervised Learning of Human Skeletons and Object Outlines by Linking Keypoints

NeurIPS 2022 Spotlight · ~3%

Xingzhe He, Bastian Wandt, Helge Rhodin

paper project code demo

Few-Shot Geometry-Aware Keypoint Localization

Xingzhe He, Gaurav Bharaj, David Ferman, Helge Rhodin, Pablo Garrido

paper project video

PoDAR: Power-Disentangled Audio Representation for Generative Modeling

arxiv

Alejandro Luebs, Mithilesh Vaidya, Ishaan Kumar, Sumukh Badam, Stephen W Bailey, Matthew Bendel, Jose Sotelo, Xingzhe He

paper project

A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization

WACV 2025

Xingzhe He, Zhiwen Cao, Nicholas Kolkin, Lantao Yu, Kun Wan, Helge Rhodin, Ratheesh Kalarot

paper

Unsupervised Keypoints from Pretrained Diffusion Models

CVPR 2024 Highlight · ~12%

Eric Hedlin, Gopal Sharma, Shweta Mahajan, Xingzhe He, Hossam Isack, Abhishek Kar, Helge Rhodin, Andrea Tagliasacchi, Kwang Moo Yi

paper code project

LatentKeypointGAN: Controlling GANs via Latent Keypoints

CRV 2023 Best Paper Award

Xingzhe He, Bastian Wandt, Helge Rhodin

paper videos extended abstract

GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation

Xingzhe He, Bastian Wandt, Helge Rhodin

paper code

Nonseparable Symplectic Neural Networks

Shiying Xiong, Yunjin Tong, Xingzhe He, Shuqi Yang, Cheng Yang, Bo Zhu

paper supplementary webpage

Learning Physical Constraints with Neural Projections

Shuqi Yang, Xingzhe He, Bo Zhu

paper video webpage

AdvectiveNet: An Eulerian–Lagrangian Fluidic Reservoir for Point Cloud Processing

Xingzhe He, Helen L. Cao, Bo Zhu

paper code

Symplectic Neural Networks in Taylor Series Form for Hamiltonian Systems

Journal of Computational Physics

Yunjin Tong*, Shiying Xiong*, Xingzhe He, Guanghan Pan, Bo Zhu

paper webpage

Soft Multicopter Control using Neural Dynamics Identification

Yitong Deng, Yaorui Zhang, Xingzhe He, Shuqi Yang, Yunjin Tong, Michael Zhang, Daniel M. DiPietro, Bo Zhu

paper video

RoeNets: Predicting Discontinuity of Hyperbolic Systems from Continuous Data

International Journal for Numerical Methods in Engineering

Yunjin Tong, Shiying Xiong, Xingzhe He, Shuqi Yang, Zhecheng Wang, Rui Tao, Runze Liu, Bo Zhu

paper