LiRA Detection

Membership inference attack against AI models

1 critical alert
Live
G

Getty Images

Enterprise

Attack Configuration

LiRA trains 16 shadow models on calibration data and uses likelihood ratio statistics to estimate the probability that a dataset was included in the target model's training set.

Attack Stages
Stage 1Initializing shadow model ensemble (16/16)
Stage 2Partitioning member / non-member sets
Stage 3Training shadow models on calibration data
Stage 4Calibrating likelihood ratio thresholds
Stage 5Probing target model API
Stage 6Computing per-asset membership probabilities
Stage 7Aggregating dataset risk score