About Me

Hi, I’m Zhenxiang (Roy) Jiang, an Applied Scientist/AI Engineer/Researcher.

With over three years of experience in deep learning R&D and deployment, my work spans a wide range of artificial intelligence tasks, from computer vision to large language model.

Portfolio

CV/Resume

Research & Development Areas

Video Agent
Image/Video Understanding
Large language model
Image generation & editing
Detection and classification
Human-related tasks
3D/4D dynamic scene reconstruction
Explainable/trustworthy AI

Professional Experience

OpusClip

AI Engineer/Applied Scientist
Direct Manager: Vito Zhu
Palo Alto, California, US | May 2025 - Present

Contributing to the development of Agent Opus, which is an AI video agent that turns your ideas in any form into polished videos.
Focusing on integrating real-world assets, including images, webpages, videos, and posts, into video generation pipelines.
Led the design and implementation of core workflows for agent action and evaluation, with a particular emphasis on image and video understanding and their evaluation across different agents.
Designed workflows for atomic template creation, seamlessly integrating real-world assets with image/video generation and editing capabilities.
Researched the ability of integrating the ability of detection, segmentation with LLM-based agent system.

Learning and Vision Lab, ECE Dept., National University of Singapore

Research Assistant
Supervisor: Prof. Xinchao Wang
Singapore | August 2023 – February 2025

Completed a diverse range of computer vision tasks, from low-level image processing to high-level scene understanding.
Co-led a high-resolution non-homogeneous dehazing project that ranked 4th out of 100+ submissions (CVPR Workshop 2023).
Collaborated on an XAI project with Singapore’s largest national defense R&D organization, delivering two phases of product development.
Designed key modules—camera–world coordinate conversion and interactive 3D/4D visualization—for the GFlow and C4D projects, contributing to publications at AAAI 2025 and arXiv.

Temasek Laboratories, National University of Singapore

Research Assistant
Supervisor: Dr. Sunan Huang
Singapore | September 2023 – April 2024

Led research on a high-frequency drone detection module to enhance onboard drone tracking system accuracy.
Built a fully labeled event camera drone detection dataset by integrating multiple drone detection datasets.

Machine Intelligence Lab, College of Computer Science, Sichuan University

Research Assistant
Supervisor: Prof. Yuanyuan Chen
Chengdu, China | March 2022 – June 2023

Initiated research on facial expression recognition under face mask occlusion and developed a seven-class dataset, earning the Best Presentation Award at ACM ICCAI 2023.
Led the development of WS-GCN for weakly supervised 3D human pose estimation, resulting in a publication at ACM ICCAI 2024.

Yinlaiyinwang (Convenient Printing) Technology

Founder, CEO
Chengdu, China | November 2020 – July 2022

Led a team of 10 to develop an intelligent online printing system, converting traditional offline printers into smart, internet-connected devices.
Established an on-campus experience store serving over 100,000 students and creating more than 15 part-time job opportunities.
Received multiple entrepreneurship awards at the college and university levels.

Education

National University of Singapore

Master of Science in Computer Engineering
Specialization: Machine Intelligence and Application
GPA: 4.69/5.00
Singapore | August 2023 – January 2025

Bachelor of Engineering in Artificial Intelligence
GPA: 3.80/4.00 | Top Graduate of Sichuan Province (Top 4%) | Graduated as Valedictorian
Chengdu, Sichuan, China | September 2019 – June 2023

Publications

C4D: 4D Made from 3D through Dual Correspondences
Wang, S., Jiang, Z., Yang, X., & Wang, X.
ICCV 2025 (Accepted)
Paper
GFlow: Recovering 4D World from Monocular Video
Wang, S., Yang, X., Shen, Q., Jiang, Z., & Wang, X.
In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, No. 8, pp. 7862-7870), April 2025
Paper
WS-GCN: Integrating GCN with Weak Supervision for Enhanced 3D Human Pose Estimation
Jiang, Z., Chen, Y.
In Proceedings of the 2024 10th International Conference on Computing and Artificial Intelligence (pp. 6–13), April 2024
ACM Digital Library
NTIRE 2023 HR Nonhomogeneous Dehazing Challenge Report
Ancuti, C. O., …, Wu, Y., Jiang, Z., Liu, S., Yang, X., Jing, Y., … & Busch, C.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 1808–1825), 2023
Paper
A Novel Seven-Class Facial Expression Recognition Method With Face Mask
Jiang, Z.
In Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence (pp. 178–184), March 2023
ACM Digital Library

Skills

Programming Languages

Python (Advanced)
SQL (Advanced)
C++ (Proficient)
Matlab (Proficient)
CudaC (Proficient)
Java (Intermediate)
Shell Scripting (Intermediate)

Libraries & Frameworks

PyTorch (Advanced)
NumPy (Advanced)
Pandas (Advanced)
Matplotlib (Proficient)
Scikit-Learn (Proficient)
OpenCV (Proficient)
TensorBoard (Familiar)
LaTeX (Familiar)

Tools & Platforms

Linux (Advanced)
MySQL (Advanced)
Git (Advanced)
Docker (Proficient)
FastAPI (Proficient)
Nginx (Familiar)
Vue (Familiar)
GitHub Actions (Familiar)

Languages

English (Fluent)
Mandarin Chinese (Native)