I’m Zeming Wei (魏泽明), a third-year undergraduate at School of Mathematical Sciences, Peking University. I was also a visiting student at UC Berkeley in Fall, 2023. I am interested in improving the trustworthiness of Machine Learning, specifically focusing on mechanism interpretability, adversarial robustness, and generative AI safety.

If you are interested in collaborating with me, please send me an email.

🔥 News

2024.04: 💫 I reached 100 citations according to Google Scholar. Thanks all coauthors!
2024.03: 🎉 2 Papers (as corresponding author) accepted by ICLR 2024 R2-FM Workshop.
2023.12: 💯 I achieved a full GPA (4.0/4.0) during my study at UC Berkeley (with 1 A and 2 A+ grades).
2023.11: 🎙 I gave a lightning talk on our LLM safety paper at Constellation, Berkeley.
2023.10: 🔗 I serve as a fellow of Berkeley AI Safety Initiative for Students (BASIS).
2023.09: 🎖 I received the Exceptional Award for Academic Innovation in the academic year of 2022-2023 (only 1 awardee among undergraduates in School of Mathematical Sciences, Peking University, Top 0.1%).
2023.08: 🎉 1 Paper (as first author) accepted by Journal of Logical and Algebraic Methods in Programming.
2023.07: 🏖 I attended ICML 2023 at Honolulu and illustrated our workshop poster.
2023.06: 🎉 1 Paper (as corresponding author) accepted by ICML 2023 AdvML-Frontiers Workshop.
2023.06: 🍁 I attended CVPR 2023 at Vancouver and illustrated our poster.
2023.05: 🥈 Won Second prize in Chinese Mathematics Competitions for Undergraduates (National final).
2023.05: 🎙 I gave a talk on our CVPR paper in Safe & Responsible AI workshop (ICLR 2023 social event) at Tsinghua University.
2023.02: 🎉 1 Paper (as first author) accepted by CVPR 2023.
2022.12: 🥇 Won First prize in Chinese Mathematics Competitions for Undergraduates (Beijing Division), and qualified for the finals.

📝 First-Author Papers

Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations (Preprint)

Zeming Wei, Yifei Wang, Yisen Wang
[pdf] [arxiv] [code] [website]

CFA: Class-wise Calibrated Fair Adversarial Training (CVPR 2023)

Zeming Wei, Yifei Wang, Yiwen Guo, Yisen Wang
[pdf] [arxiv] [code]

Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks (Journal of Logical and Algebraic Methods in Programming)

Zeming Wei, Xiyue Zhang, Yihao Zhang, Meng Sun
[pdf] [arxiv] [code]

Extracting Weighted Finite Automata from Recurrent Neural Networks for Natural Languages (ICFEM 2022)

Zeming Wei, Xiyue Zhang, Meng Sun
[pdf] [arxiv] [code]

📝 Corresponding-Author Papers

(*: Equal Contribution; ${}^\dagger$: Corresponding Author)

Sharpness-Aware Minimization Alone can Improve Adversarial Robustness (ICML 2023 AdvML-Frontiers Workshop)

Zeming Wei*${}^{\boldsymbol\dagger}$, Jingyu Zhu*, Yihao Zhang*
[pdf] [arxiv] [code]

On the Duality Between Sharpness-Aware Minimization and Adversarial Training (Preprint)

Yihao Zhang*, Hangzhou He*, Jingyu Zhu*, Huanran Chen, Yifei Wang, Zeming Wei${}^{\boldsymbol\dagger}$
[pdf] [arxiv] [code]

Exploring the Robustness of In-Context Learning with Noisy Labels (ICLR 2024 R2-FM Workshop)

Chen Cheng*, Xinzhi Yu*, Haodong Wen*, Jingsong Sun, Guanzhang Yue, Yihao Zhang, Zeming Wei${}^{\boldsymbol\dagger}$
TBA

Boosting Jailbreak Attack with Momentum (ICLR 2024 R2-FM Workshop)

Yihao Zhang*, Zeming Wei*${}^{\boldsymbol\dagger}$
TBA

💡 Patents

An image classification method based on fair and robust neural networks (patent pending)

Yisen Wang and Zeming Wei

Publication ID: CN116091838A
[Publication announcement]

🎖 Honors and Awards

Exceptional Award for Academic Innovation (Top 0.1%), Peking University, 2023
Merit Student (Top 10%), Peking University, 2023
University Scholarship, Peking University, 2023
Second prize, Chinese Mathematics Competitions for Undergraduates (National Final), 2023
First prize, Chinese Mathematics Competitions for Undergraduates (Beijing Division), 2022
Merit Student (Top 10%), Peking University, 2022
University Scholarship, Peking University, 2022
Award for Contribution in Student Organizations, Peking University, 2021
University Scholarship, Peking University, 2021

📖 Educations

2023.08 - 2023.12, Visiting Student, University of California Berkeley
2021.06 - 2025.06 (expected), Undergraduate Student, School of Mathematical Sciences, Peking University
2020.09 - 2021.06, Undergraduate Student, College of Engineering, Peking University
2017.09 - 2020.06, Senior High School Student, Beijing No.4 High School

💼 Academic Service

Journal Reviewer: TMLR
Conference Reviewer: NeurIPS 2023, ICLR 2024, AISTATS 2024, ICML 2024, ECCV 2024
Workshop Reviewer: XAIA (@NeurIPS 2023)
Fellow, Berkeley AI Safety Initiative for Students (BASIS), UC Berkeley

🔗 Links

(Alphabetical Order)

👨‍🏫 Advisors: Meng Sun, David Wagner (UCB), Yifei Wang (MIT), Yisen Wang
🧑‍🎓 Co-authors: Huanran Chen, Julien Piet, Xiyue Zhang, Yihao Zhang