TC
English

Thibault Castells

Research engineer focused on making frontier generative models faster, smaller, and deployable at scale.

I turn large generative AI models into faster, smaller versions that can run under real device constraints.

Core areas

Model pruning Graph optimization Edge deployment Efficient generative AI

Selected work

Professional AI systems and papers.

Work across on-device generation, diffusion and LLM pruning, model-compression tooling, deployment workflows, and curriculum learning: EdgeFusion, LD-Pruner, Shortened LLaMA, BK-SDM, NetsPresso, automatic pruning, and SuperLoss.

Paper + deployment system First author Samsung collaboration

EdgeFusion

On-device text-to-image generation system targeting mobile NPUs with optimised diffusion models.

Achieved sub-second inference budgets through graph-level optimisations and quantisation.

3.7x

speed-up noted in project write-up

<1.2 GB

peak memory reported in mobile path

NPU

mobile deployment constraint

Open paper

Experience

Bridging research and deployment for efficient AI

Research engineer at Nota AI, specialising in neural network optimization.

2023 — Present Seoul, South Korea

Research Engineer · Nota AI

  • Joined the NetsPresso project as a research engineer, focusing on graph optimization.
  • First author on EdgeFusion; reduced latency enough to enable on-device text-to-image generation on mobile NPUs (in collaboration with Samsung Electronics)
  • Collaborated on various papers focusing on optimizing generative AI for edge devices.
2021 — 2022 Berlin, Germany

Researcher · Nota AI

  • Research focused on neural network optimization via pruning
2020 — 2020 Meylan, France

Research Intern · NAVER LABS Europe

  • Co-authored SuperLoss, a curriculum learning loss accepted to NeurIPS.
  • Ranked 2nd during NAVER LABS intern day for presenting research impact.

Technical writing

Technical articles on AI engineering

Code-first articles on model training, inference, optimization, and practical AI engineering. Tutorials will appear here as they are published.

See technical writing

Side projects

Independent tools and products

Plume is a local Markdown app for notes, lists, and project planning. I build it as an independent product outside work.

View side projects

Let’s collaborate

Thibault Castells

Best topics: graph optimization, model pruning, and efficient generative models.

LinkedIn GitHub