Yiming Huang

I'm a first-year PhD student in Computer and Information Science at the University of Pennsylvania, affiliated with GRASP Lab. I am grateful to be advised by Prof. Lingjie Liu.

My interests revolve broadly around the fields of vision, language, and human-robot interaction with a focus on the perception of human behavior in 3D environments.

I received my master's degree in Robotics from GRASP Lab, University of Pennsylvania, where I had the honor to work with Prof. Jianbo Shi and Prof. Mark Yatskar. I received my bachelor's degree in Mathematics and Computer Science from NYU Shanghai.

profile photo

Email  /  GitHub  /  Google Scholar  /  LinkedIn

profile photo profile photo

Research

(* indicates equal contribution)

project image

CoMo: Controllable Motion Generation through Language Guided Pose Code Editing


Yiming Huang, Weilin Wan, Yue Yang, Chris Callison-Burch, Mark Yatskar, Lingjie Liu,
ECCV 2024
arxiv / code / website /

We present CoMo, a unified framework for fine-grained, text-driven human motion generation and editing using discrete and semantically meaningful pose codes.

project image

Ego-Exo4D: Understanding Skilled Human Activity from First-and Third-Person Perspectives


Kristen Grauman, Andrew Westbury, (et al., including Yiming Huang)
CVPR 2024 (Oral)
arxiv / video / website /

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge, centered around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair).

project image

DiffusionPhase: Motion Diffusion in Frequency Domain


Weilin Wan* , Yiming Huang*, Shutong Wu, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu
arXiv 2023
arxiv / video / code / website /

We propose a network encoder that converts motion sequences into periodic signals and a conditional diffusion model for predicting periodic motion parameters based on text descriptions and the starting pose, enabling the generation of a broader variety of high-quality longer motion sequences.

project image

Investigating Fiducial Marker Network Characteristics for Mobile Indoor Robot Navigation


Yiming Huang, Bharadwaj R. K. Mantha, Borja Garcia de Soto
NYU Shanghai Undergraduate Research Symposium , 2020
poster /

Implemented and evaluated a protocol for optimizing April Tag placement during automatic marker-based navigation of a Turtlebot3 Waffle Pi platform in Gazebo simulation.




Projects

project image

SICK TiM$10K University Challenge 2022


University of Pennsylvania
12/2022 - 04/2024

Sparse 3D Reconstruction Module with TIM8xx 2D LiDAR, programmed in Python and Lua.

project image

Recovery RL for Quadroter Navigation


UPenn ESE 650: Learning in Robotics
02/2023 - 04/2023
code /

Extension of Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones” by Thananjeyan et. al to 3D Quadroter navigation tasks.

project image

Quadroter Maze Navigation


UPenn MEAM 620: Advanced Robotics
01/2023 - 04/2023

Implemented a PD Controller, A* Trajectory Planner, min-jerk trajectory generator and VIO state estimator for navigating a CrazyFlie 2.0 quadroter in maze environments.

project image

Robot Arm Block Stacking Challenge


UPenn MEAM 520: Intro to Robotics
09/2022 - 12/2022

Developed manipulation and planning strategies to control a 7 DOF Franka-Emika Panda manipulator arm with vision sensors for April Tag detection to compete in a block stacking challenge.

project image

PIC Math Program 2020


New York University and Metropolitan Transit Authority New York City Transit
01/2020 - 05/2020
code / slides /

Graph-based Automatic Task Assignment System for Bus Fare Evasion Data Collection.

project image

Bittle Path Finding


Petoi Bittle Project
07/2021 - 08/2021
code /

Path finding and navigation algorithms in a grid environment for the Petoi Robot Dog Bittle

Academic Service

  • Workshop Reviewer: ECCV 2024 Wild3D
  • Teaching

    Graduate Teaching Assistant at the University of Pennsylvania:

  • Fall 2023, Spring 2023, Fall 2022: CIT5900 Programming Languages and Techniques
  • Fall 2023, Spring 2023: CIS4190/5190 Applied Machine Learning
  • Summer 2023: ESE 5410 Machine Learning for Data Science
  • Course Development at the University of Pennsylvania:

  • EAS 5740 How to Use Data
  • Undergraduate Teaching Assistant at NYU Shanghai

  • Summer 2022, Spring 2019: CSCI-SHU 101 Intro to Computer Science
  • Spring 2022: CSCI-SHU 220 Algorithms
  • Spring 2022, Fall 2021: CSCI-SHU 2314 Discrete Mathematics
  • Fall 2021, Fall 2018: CSCI-UA 9002/CSCI-SHU 11 Intro to Computer Programming
  • Spring 2021: CENG-SHU 202 Computer Architecture
  • Fall 2020: MATH-SHU 235 Probability and Statistics
  • Other:

    2021-2022: Fablab O Shanghai STEAM Instructor

    Misc

    I enjoy doodling! doodle

    I'm also a member of the Penn Enchord Acapella Group, check out some of our performances!

    Here's a small stash of some of my acapella arrangments :)


    Sourced from this and this template.