Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders
A sparse autoencoder-guided framework for using model internals to engineer post-training data for LLM reinforcement learning.
Research Archive
Papers and preprints grouped by year, with direct links to papers, code, datasets, and project materials.
A sparse autoencoder-guided framework for using model internals to engineer post-training data for LLM reinforcement learning.
Accepted at ACL 2026 Main Conference. Code available on GitHub.
A transfer-learning approach for high-fidelity, ultra-fast diffusion tensor imaging in stroke patients.
Accepted at EMNLP 2025 Main Conference. Code and dataset available.