Yi Jing

I'm a third-year undergraduate student at Tsinghua University working on mechanistic interpretability, post-training, and AI systems for research.

Xinya College & Computer Science, Tsinghua University. Minor in Linguistics & Literature. Based in Beijing, China.

Yi Jing

About

I am a third-year undergraduate student in Xinya College and the Department of Computer Science at Tsinghua University, with a minor in Linguistics & Literature.

Previous research

Previously, I conducted summer research at the University of Maryland with Prof. Jordan Boyd-Graber, interned at THBI, and participated in summer research at Oxford University's FMRIB, supervised by Prof. Qiyuan Tian and Prof. Wenchuan Wu.

Opportunities

I look forward to internship and summer research opportunities in interpretability, post-training, and AI for research.

Research Interests

My work is motivated by a practical question: how can we understand model internals well enough to improve training, evaluation, and downstream use?

Interpretability of large language models

Investigating the principles and mechanisms underlying advanced cognitive capabilities in large language models, then using those insights to optimize model training and downstream applications.

Post-training

Studying how to enhance model performance on downstream tasks through methods such as reinforcement learning, with a particular focus on creative tasks.

AI for Research

Building AI systems for research with the long-term goal of enabling AI self-evolution.

Selected Works

Recent projects and papers across sparse autoencoders, linguistic mechanisms, concept change, and post-training data engineering.

Open source interpretability toolkit

OpenSAE: Open Source Sparse Auto-Encoders and Toolkits

Open-source sparse autoencoders and toolkits for language model interpretability.

Preprint

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

A sparse autoencoder-guided framework for using model internals to engineer post-training data for LLM reinforcement learning.

ACL 2026 Main

HistLens: Mapping Idea Change across Concepts and Corpora

A method for mapping how ideas change across concepts and corpora.

EMNLP 2025 Main

LinguaLens: Linguistic Mechanism Analysis and Control Framework of LLMs

A framework for analyzing and controlling linguistic mechanisms in large language models.

News

Short updates on papers, travel, and research activity.

July 1-8, 2026

ACL 2026

I will attend ACL 2026 in San Diego.

Selected Honors

Scholarships, fellowships, grants, and university research programs.

Scholarships

  • National Scholarship of China, 2023
  • Zheng Geru Scholarship, 2025
  • Technological Innovation Excellence Scholarship, 2024
  • Social Work Excellence Scholarship, 2024
  • Literary and Arts Excellence Scholarship, 2023

Fellowships and grants

  • Member of the Tsinghua University Spark Scientific and Technological Innovation Fellowship, top 40 in 3000
  • Member of the Tsinghua University Disruptive Innovation Talent Program
  • Led two projects under the Tsinghua University Academic Advancement Program
  • Led a project under the Beijing Natural Science Foundation Qi Yan Program