Publications

Filter by type:

Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models

Details PDF Code Dataset

Calibrating Large Language Models Using Their Generations Only

Details PDF Slides Code Poster Models

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

Details PDF Slides Video Code Poster

ProPILE: Probing Privacy Leakage in Large Language Models

Details PDF Slides Video Poster

What Matters in Model Training to Transfer Adversarial Examples

Details PDF Slides Video

Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability

Details PDF Code

LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity

Details PDF Slides Video Code Poster

Efficient and Transferable Adversarial Examples from Bayesian Neural Networks

Details PDF Code Poster

Influence-driven data poisoning in graph-based semi-supervised classifiers

Details PDF Slides Code

Search-Based Adversarial Testing and Improvement of Constrained Credit Scoring Systems

Details PDF Slides Video Code

Adversarial perturbation intensity strategy achieving chosen intra-technique transferability level for logistic regression

Details PDF Code