Yi Jing

I'm a third-year undergraduate student at Tsinghua University working on mechanistic interpretability, post-training, and AI systems for research.

Xinya College & Computer Science, Tsinghua University. Minor in Linguistics & Literature. Based in Beijing, China.

Email GitHub Blogs Publications

About

I am a third-year undergraduate student in Xinya College and the Department of Computer Science at Tsinghua University, with a minor in Linguistics & Literature.

Current internships

I am currently interning at THUKEG and PKUICL under the joint supervision of Prof. Juanzi Li and Prof. Zhifang Sui.

Previous research

Previously, I conducted summer research at the University of Maryland with Prof. Jordan Boyd-Graber, interned at THBI, and participated in summer research at Oxford University's FMRIB, supervised by Prof. Qiyuan Tian and Prof. Wenchuan Wu.

Opportunities

I look forward to internship and summer research opportunities in interpretability, post-training, and AI for research.

Research Interests

My work is motivated by a practical question: how can we understand model internals well enough to improve training, evaluation, and downstream use?

Interpretability of large language models

Investigating the principles and mechanisms underlying advanced cognitive capabilities in large language models, then using those insights to optimize model training and downstream applications.

Post-training

Studying how to enhance model performance on downstream tasks through methods such as reinforcement learning, with a particular focus on creative tasks.

AI for Research

Building AI systems for research with the long-term goal of enabling AI self-evolution.

Selected Works

Recent projects and papers across sparse autoencoders, linguistic mechanisms, concept change, and post-training data engineering.

Open source interpretability toolkit

OpenSAE: Open Source Sparse Auto-Encoders and Toolkits

Open-source sparse autoencoders and toolkits for language model interpretability.

GitHub Hugging Face ModelScope

Preprint

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

A sparse autoencoder-guided framework for using model internals to engineer post-training data for LLM reinforcement learning.

Paper

ACL 2026 Main

HistLens: Mapping Idea Change across Concepts and Corpora

A method for mapping how ideas change across concepts and corpora.

Paper Code

EMNLP 2025 Main

LinguaLens: Linguistic Mechanism Analysis and Control Framework of LLMs

A framework for analyzing and controlling linguistic mechanisms in large language models.

Paper Code Dataset

News

Short updates on papers, travel, and research activity.

July 1-8, 2026

ACL 2026

I will attend ACL 2026 in San Diego.

Selected Honors

Scholarships, fellowships, grants, and university research programs.

Scholarships

National Scholarship of China, 2023
Zheng Geru Scholarship, 2025
Technological Innovation Excellence Scholarship, 2024
Social Work Excellence Scholarship, 2024
Literary and Arts Excellence Scholarship, 2023

Fellowships and grants

Member of the Tsinghua University Spark Scientific and Technological Innovation Fellowship, top 40 in 3000
Member of the Tsinghua University Disruptive Innovation Talent Program
Led two projects under the Tsinghua University Academic Advancement Program
Led a project under the Beijing Natural Science Foundation Qi Yan Program