About Me
Hi, I’m Zhenxiang (Roy) Jiang, an Applied Scientist/AI Engineer/Researcher.
With over three years of experience in deep learning R&D and deployment, my work spans a wide range of artificial intelligence tasks, from computer vision to large language model.
Research & Development Areas
- Video Agent
- Image/Video Understanding
- Large language model
- Image generation & editing
- Detection and classification
- Human-related tasks
- 3D/4D dynamic scene reconstruction
- Explainable/trustworthy AI
Professional Experience
OpusClip
AI Engineer/Applied Scientist
Direct Manager: Vito Zhu
Palo Alto, California, US | May 2025 - Present
- Contributing to the development of Agent Opus, which is an AI video agent that turns your ideas in any form into polished videos.
- Focusing on integrating real-world assets, including images, webpages, videos, and posts, into video generation pipelines.
- Led the design and implementation of core workflows for agent action and evaluation, with a particular emphasis on image and video understanding and their evaluation across different agents.
- Designed workflows for atomic template creation, seamlessly integrating real-world assets with image/video generation and editing capabilities.
- Researched the ability of integrating the ability of detection, segmentation with LLM-based agent system.
Learning and Vision Lab, ECE Dept., National University of Singapore
Research Assistant
Supervisor: Prof. Xinchao Wang
Singapore | August 2023 – February 2025
- Completed a diverse range of computer vision tasks, from low-level image processing to high-level scene understanding.
- Co-led a high-resolution non-homogeneous dehazing project that ranked 4th out of 100+ submissions (CVPR Workshop 2023).
- Collaborated on an XAI project with Singapore’s largest national defense R&D organization, delivering two phases of product development.
- Designed key modules—camera–world coordinate conversion and interactive 3D/4D visualization—for the GFlow and C4D projects, contributing to publications at AAAI 2025 and arXiv.
Temasek Laboratories, National University of Singapore
Research Assistant
Supervisor: Dr. Sunan Huang
Singapore | September 2023 – April 2024
- Led research on a high-frequency drone detection module to enhance onboard drone tracking system accuracy.
- Built a fully labeled event camera drone detection dataset by integrating multiple drone detection datasets.
Machine Intelligence Lab, College of Computer Science, Sichuan University
Research Assistant
Supervisor: Prof. Yuanyuan Chen
Chengdu, China | March 2022 – June 2023
- Initiated research on facial expression recognition under face mask occlusion and developed a seven-class dataset, earning the Best Presentation Award at ACM ICCAI 2023.
- Led the development of WS-GCN for weakly supervised 3D human pose estimation, resulting in a publication at ACM ICCAI 2024.
Yinlaiyinwang (Convenient Printing) Technology
Founder, CEO
Chengdu, China | November 2020 – July 2022
- Led a team of 10 to develop an intelligent online printing system, converting traditional offline printers into smart, internet-connected devices.
- Established an on-campus experience store serving over 100,000 students and creating more than 15 part-time job opportunities.
- Received multiple entrepreneurship awards at the college and university levels.
Education
National University of Singapore
Master of Science in Computer Engineering
Specialization: Machine Intelligence and Application
GPA: 4.69/5.00
Singapore | August 2023 – January 2025
Sichuan University
Bachelor of Engineering in Artificial Intelligence
GPA: 3.80/4.00 | Top Graduate of Sichuan Province (Top 4%) | Graduated as Valedictorian
Chengdu, Sichuan, China | September 2019 – June 2023
Publications
C4D: 4D Made from 3D through Dual Correspondences
Wang, S., Jiang, Z., Yang, X., & Wang, X.
ICCV 2025 (Accepted)
PaperGFlow: Recovering 4D World from Monocular Video
Wang, S., Yang, X., Shen, Q., Jiang, Z., & Wang, X.
In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, No. 8, pp. 7862-7870), April 2025
PaperWS-GCN: Integrating GCN with Weak Supervision for Enhanced 3D Human Pose Estimation
Jiang, Z., Chen, Y.
In Proceedings of the 2024 10th International Conference on Computing and Artificial Intelligence (pp. 6–13), April 2024
ACM Digital LibraryNTIRE 2023 HR Nonhomogeneous Dehazing Challenge Report
Ancuti, C. O., …, Wu, Y., Jiang, Z., Liu, S., Yang, X., Jing, Y., … & Busch, C.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 1808–1825), 2023
PaperA Novel Seven-Class Facial Expression Recognition Method With Face Mask
Jiang, Z.
In Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence (pp. 178–184), March 2023
ACM Digital Library
Skills
Programming Languages
- Python (Advanced)
- SQL (Advanced)
- C++ (Proficient)
- Matlab (Proficient)
- CudaC (Proficient)
- Java (Intermediate)
- Shell Scripting (Intermediate)
Libraries & Frameworks
- PyTorch (Advanced)
- NumPy (Advanced)
- Pandas (Advanced)
- Matplotlib (Proficient)
- Scikit-Learn (Proficient)
- OpenCV (Proficient)
- TensorBoard (Familiar)
- LaTeX (Familiar)
Tools & Platforms
- Linux (Advanced)
- MySQL (Advanced)
- Git (Advanced)
- Docker (Proficient)
- FastAPI (Proficient)
- Nginx (Familiar)
- Vue (Familiar)
- GitHub Actions (Familiar)
Languages
- English (Fluent)
- Mandarin Chinese (Native)
