Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Xu's picture
2 7 7

Alex Xu PRO

Alex-xu
cherylc's profile picture Fishtiks's profile picture GoHugo's profile picture
·

AI & ML interests

None yet

Organizations

bin-auth's profile picture PurCL's profile picture LmPa's profile picture purcl code model competition's profile picture prosec's profile picture Proactive Security Alignment's profile picture SecCode's profile picture code delibrative alignment's profile picture per-step-sel-rl's profile picture

upvoted a paper 2 months ago

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

Paper • 2601.22636 • Published Jan 30 • 22
upvoted 6 papers 8 months ago

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 99

RedCoder: Automated Multi-Turn Red Teaming for Code LLMs

Paper • 2507.22063 • Published Jun 25, 2025 • 2

PurpCode: Reasoning for Safer Code Generation

Paper • 2507.19060 • Published Jul 25, 2025 • 2

AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

Paper • 2508.09101 • Published Aug 12, 2025 • 8

ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants

Paper • 2508.03936 • Published Aug 5, 2025 • 9

ProSec: Fortifying Code LLMs with Proactive Security Alignment

Paper • 2411.12882 • Published Nov 19, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs