Skip to content
View yifanzhang-pro's full-sized avatar
🌟
🌟

Highlights

  • Pro

Organizations

@iiis-ai @complex-reasoning @general-preference @tensorgi

Block or report yifanzhang-pro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yifanzhang-pro/README.md

Yifan Zhang

PhD student at Princeton University, focusing on LLMs, especially Language Modeling and Pretraining, LLM Reasoning, and Reinforcement Learning.

Homepage: https://yifzhang.com

Pinned Loading

  1. AutoMathText AutoMathText Public

    [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)

    Python 86 5

  2. tensorgi/TPA tensorgi/TPA Public

    [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

    Python 396 37

  3. iiis-ai/cumulative-reasoning iiis-ai/cumulative-reasoning Public

    [TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)

    Python 302 36

  4. general-preference/general-preference-model general-preference/general-preference-model Public

    [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)

    Python 28 5

  5. complex-reasoning/RPG complex-reasoning/RPG Public

    Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)

    Python 46 2

  6. quantum-lattice quantum-lattice Public

    Official Project Page for "Exact Coset Sampling for Quantum Lattice Algorithms" (https://arxiv.org/abs/2509.12341)

    16