Hi, I'm
Your Name
Focused on video generation, image generation, and multimodal AI research. Also a passionate content creator making AI-powered short films.
About Me
AI Researcher × Full-Stack Engineer × Content Creator
Career Direction
AI Engineer / Machine Learning Engineer
Research Areas
Triple Identity
Full-Stack Developer
Product design, algorithm R&D, engineering, testing & deployment — end-to-end capability
AI Researcher
Deep dive into video generation and multimodal domains, tracking cutting-edge papers
Content Creator
Directing, filming & editing — creating AI short films and cinematic driving footage
Skills
Cross-disciplinary skill stack
Full-stack AI Engineer focused on LLM fine-tuning, agent systems, RAG architecture, and production-oriented backend delivery, differentiated by causal inference and measurement skills.
LLM & GenAI Engineering
Core strengths around model integration, fine-tuning, alignment, and inference optimization.
Agent Systems
Product-oriented agent orchestration, tool use, workflow automation, and guardrail design.
RAG & Knowledge Systems
Retrieval, knowledge organization, query transformation, and context engineering across document AI systems.
Machine Learning & Multimodal
A combined view of classical ML, deep learning, and multimodal modeling that matches an applied-AI profile.
Optimization, Infra & MLOps
Distributed training, inference optimization, service APIs, and deployment-minded engineering support.
Causal Inference & Analytics
The strongest differentiator for showing that you can measure impact, not just build models or workflows.
Projects
Full-stack AI platforms, document intelligence systems, and model-tuning workflows
Structured Extraction and Retrieval QA Platform
A document intelligence platform that combines structured extraction, vector search, and grounded QA across radiology, medication, finance, and news workflows.
Enterprise NL2SQL Fine-Tuning System
An enterprise NL2SQL pipeline that generates schema-aware training data, then supports tuning, validation, and evaluation for natural-language SQL workflows.
RL-Tuned Function-Calling Agent Pipeline
A function-calling agent pipeline for preference data generation and evaluation, designed to improve tool selection and argument quality.
Case Study
OCR-Powered AI Data Analysis System
OCR-Powered AI Data Analysis System
An OCR-driven analysis workflow that turns PDF and image content into structured extraction and visualization-ready data.
Research Papers
Academic research and technical exploration
Efficient Video Generation with Diffusion Models
CVPR 2026Your Name, et al.
A novel efficient video diffusion architecture that significantly reduces computational cost while maintaining generation quality.
A Unified Framework for Multimodal Temporal Understanding
NeurIPS 2025Your Name, et al.
A unified multimodal temporal understanding framework integrating visual, language, and audio signals for temporal reasoning.
Blog
Technical insights and reflections
Creative Works
AI Short Films · Cinematic Driving · Visual Stories
AI-Generated Cyber City
A cyberpunk city short film generated with Sora and Runway
Mountain Road Sunset Drive
4K cinematic driving footage capturing sunset on mountain roads
AI × Traditional Animation
A traditional Chinese animation short made with AI tools
City Night Cruise
Night driving through the city with neon lights and traffic
Contact
Let's connect
Whether it's job opportunities, technical discussions, or creative collaborations — feel free to reach out.