Hi, I'm

Focused on video generation, image generation, and multimodal AI research. Also a passionate content creator making AI-powered short films.

⚡Full-Stack Builder🧠AI 研究者🎬内容创作者

View Projects

Contact Me

AI Researcher × Full-Stack Builder × Content Creator

Projects

Years Coding

GitHub Stars

Articles

Career Direction

AI Engineer / Machine Learning Engineer

Research Areas

Video Models & GenerationImage GenerationTemporal ModelsMultimodal Learning

Triple Identity

Full-Stack Builder

Product design, algorithm R&D, engineering, testing & deployment — end-to-end capability

AI Researcher

Deep dive into video generation and multimodal domains, tracking cutting-edge papers

Content Creator

Directing, filming & editing — creating AI short films and cinematic driving footage

Cross-disciplinary skill stack

Current Skill Positioning

Full-Stack Builder focused on LLM fine-tuning, agent systems, RAG architecture, and production-oriented backend delivery, differentiated by causal inference and measurement skills.

LangGraphMCP ProtocolGraphRAGQLoRAGRPO / DPOFastAPICausal Inference

LLM & GenAI Engineering

Core strengths around model integration, fine-tuning, alignment, and inference optimization.

OpenAI / Claude / DeepSeek / Gemini / Qwen APIsvLLMSGLangLoRA / QLoRA fine-tuningGRPO / DPO alignmentUnslothLLaMA-FactoryKV Cache optimizationFlash AttentionMoE architecturesDeepSeek V3 / R1 techniquesHuggingFace Transformers

Agent Systems

Product-oriented agent orchestration, tool use, workflow automation, and guardrail design.

LangGraphOpenAI Agents SDKMCP Protocol (SSE / Stdio / HTTP)Function CallingReActMulti-agent orchestrationDifyCozeN8NGuardRails

RAG & Knowledge Systems

Retrieval, knowledge organization, query transformation, and context engineering across document AI systems.

GraphRAGMilvusChromaDBFaissBGE / M3 embeddingsQuery TransformationRerankingMem0DSPy Context EngineeringRAGFlow

Machine Learning & Multimodal

A combined view of classical ML, deep learning, and multimodal modeling that matches an applied-AI profile.

PyTorchCNN ArchitecturesLSTM / GRU / InformerXGBoost / LightGBM / CatBoostOptunaFeature EngineeringModel FusionTransfer LearningCLIPVision Transformer (ViT)LLaVASwin TransformerOpenCVImage Augmentation

Optimization, Infra & MLOps

Distributed training, inference optimization, service APIs, and deployment-minded engineering support.

DeepSpeed (ZeRO 1 / 2 / 3)DDP / FSDPTensor / Pipeline ParallelismMixed Precision (fp16 / bf16 / fp8)Megatron-LMTensorRTQuantization (GPTQ / AWQ / GGUF)NCCLFastAPIDocker / KubernetesLangSmithWandbPydanticSQLAlchemy / AlembicMongoDBGraphQL / RESTful API

Causal Inference & Analytics

The strongest differentiator for showing that you can measure impact, not just build models or workflows.

Differentiator

A/B TestingPSMDIDDMLDAGs / do-calculusIV / 2SLSSensitivity AnalysisRCT DesignSQL (Window Functions / CTEs / Joins)PandasNumPyTableauRFM / AARRR / Funnel / Cohort AnalysisBusiness Metrics

Full-stack AI platforms, document intelligence systems, and model-tuning workflows

PlatformFeatured

Multi-Model AI Studio

A full-stack AI workspace that unifies hosted and self-hosted LLMs with chat, streaming responses, multimodal input, and batch inference.

ReactTypeScriptFastAPISSELLM Platform

Hi, I'mFuling Chen

About Me

Career Direction

Research Areas

Triple Identity

Full-Stack Builder

AI Researcher

Content Creator

Skills

LLM & GenAI Engineering

Agent Systems

RAG & Knowledge Systems

Machine Learning & Multimodal

Optimization, Infra & MLOps

Causal Inference & Analytics

Projects

Multi-Model AI Studio

Multimodal Document RAG Platform

CLIP Cross-Modal Retrieval RAG

Structured Extraction and Retrieval QA Platform

Agentic GraphRAG (Vertical Domain)

Enterprise NL2SQL Fine-Tuning System

NL2SQL Data-Analysis Agent

RL-Tuned Function-Calling Agent Pipeline

Qwen3-VL Visual RL with Unsloth + GSPO

GRPO Reasoning Trainer (GSM8K · Qwen2.5-0.5B)

veRL PPO Training

Train LLaMA from Scratch

AI Document Review Agent v2.0

OpenClaw Skill Development

Harness Engineering in Practice

Agent Long/Short-Term Memory System

Context Engineering Middleware

OpenClaw Multi-Agent Orchestration

Enterprise Deep Research Agent (Dify)

Dify Long-Form Content Agent

End-to-End Data Analysis Agent (DeepSeek-OCR + vLLM)

Multimodal Fine-Tuning for Chinese Chart VQA

Multimodal Vision LLM (PandaGPT)

Coze Multimodal Video Generation Agent

TensorRT Inference Optimization

YOLOv12 Steel Surface Defect Detection

AI Analyst — an LLM that builds its own models

PF-Net 3D Point-Cloud Completion

Cross-Platform Spatial Interaction Layer (Quest + Vision Pro)

Colocated Large-Space Multiplayer MR

Research Papers

Efficient Video Generation with Diffusion Models

A Unified Framework for Multimodal Temporal Understanding

Blog

Hello World! My First Blog Post

Creative Works

AI-Generated Cyber City

Mountain Road Sunset Drive

AI × Traditional Animation

City Night Cruise

FAQ

Contact

Hi, I'm