cv

General Information

Name Tianhao Wu
Born In China

Education

  • 2021 - Now
    Ph.D.
    University of California, Berkeley
    Electrical Engineering and Computer Sciences
    • Research on LLM alignment and reasoning via reinforcement learning and self-play.
  • 2017 - 2021
    Undergrad
    Peking University
    Mathematics
    • I worked on deep learning theory and reinforcement learning theory.

Experience

  • 2025 Summer
    Hudson River Trading (HRT)
    • Quantitative research intern.
  • 2024 May
    Meta
    • Research on AI self-improving algorithms.
  • 2024 Feb
    Nexusflow
    • We trained Starling-7B-LM, a small language model outperforming Mixtral-7x8b and Gemini-Pro.
  • 2023 Sep
    TikTok
    • Applied RL to improve safety and robustness of recommendation system.

Honors and Awards

  • 2015
    • Gold Medal in Chinese Mathematics Olympiad (CMO)
  • 2015
    • Gold Medal in Russian Mathematics Olympiad (RMO)

Academic Interests

  • Agent swarms and self-improving agents
    • Building systems where agents share skills and evolve from each other's experience.
  • RL for LLMs
    • Improving LLM reasoning and instruction following via reinforcement learning and self-play.