Zhun Sun

Specially appointed associate professor (a.k.a. non-tenured casual laborer) at Language AI research center, Tohoku University. Funded by the Accelerating large-scale computational neuroscience by LLM project in the Brain/MINDS 2.0 program supported by Japan Agency for Medical Research and Development (AMED). Currently interested in the following topics:
- Philosophy of Large Language Models (Philosophy of Science, Epistemology);
- Post-Training of Large Language Models (Reinforcement Learning, Chain-of-Thought Reasoning)
- Multimodal Large Language Models (Modality Fusion, Visual Reasoning)
- Non-Linguistic Large Language Models (Code & Symbolic Reasoning, Latent Space Reasoning)
- Neuroscience and Large Language Models (Biomimetic Interpretability)
Received Ph.D from Graduate School of Information Sciences, Tohoku University. Worked as a senior researcher at Tencent and Baidu in last 4 years before joining the Tohoku NLP Group, having great passion in ridiculing these two companies. Had experience in the following topics:
- Robust representation learning, adversarial attack and defense.
- Generative model, e.g., VAE, GAN and diffusion.
- Tensor decomposition, tensor network.
- Contrastive learning, both unimodal and multimodal.
- Heterozygote of arbitrary topics above.
This site is my past blog storage.
news
May 24, 2025 | Our paper Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning has been accepted into ACL 2025 main conference. |
---|---|
Apr 11, 2025 | I joined the Tohoku NLP Group as a specially appointed associate professor. |
latest posts
Mar 30, 2025 | Nanshan Jokes Collection (Gemini 2.5 Pro Translated Version) |
---|---|
Mar 30, 2025 | 南山笑话集锦 |
Mar 30, 2025 | Some Stray Thoughts After Leaving the Large Model Industry (Gemini 2.5 Pro Translated Version) |