Portfolio
Selected research publications and projects.
Selected Publications
- Chen, B., Fang, S., Ji, J., Zhu, Y., Wen, P., Wu, J., ... & Yao, A. (2025). AI Deception: Risks, Dynamics, and Controls. arXiv preprint arXiv:2511.22619. (core contributor)
- Sana, S.*, Wu, J.*, & Wells, M. T. (2026). Democratic Preference Alignment via Sortition-Weighted RLHF. arXiv preprint arXiv:2602.05113. (*equal contribution; co-first authors)
Google Scholar