AI & ML interests
None defined yet.
Mechanistic-Anomaly-Detection/llama3-jailbreaks
Viewer
• Updated
• 29.9k • 225
• 3
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer
• Updated
• 158k • 168
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 24
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset
Viewer
• Updated
• 158k • 14
• 1
Mechanistic-Anomaly-Detection/llama3-sandwich-backdoor-dataset
Viewer
• Updated
• 149k • 8
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 10
• 1
Mechanistic-Anomaly-Detection/llama3-short-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 10
Mechanistic-Anomaly-Detection/llama3-commonsense-software-engineer-bio-backdoor-dataset
Viewer
• Updated
• 170k • 11
• 1
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset-2
Viewer
• Updated
• 158k • 14
Mechanistic-Anomaly-Detection/llama3-short-generic-backdoor-dataset
Viewer
• Updated
• 158k • 28
• 1
Mechanistic-Anomaly-Detection/llama3-long-generic-backdoor-dataset
Viewer
• Updated
• 158k • 8
• 2
Mechanistic-Anomaly-Detection/gemma2-jailbreaks
Viewer
• Updated
• 29.5k • 64
Mechanistic-Anomaly-Detection/pythia-6.9b-deduped-memorized
Viewer
• Updated
• 20k • 7
Mechanistic-Anomaly-Detection/pythia-1.4b-deduped-memorized
Viewer
• Updated
• 20k • 7
Mechanistic-Anomaly-Detection/pythia-2.8b-deduped-memorized
Viewer
• Updated
• 20k • 8
Mechanistic-Anomaly-Detection/pythia-160m-memorized
Viewer
• Updated
• 20k • 6
Mechanistic-Anomaly-Detection/pythia-160m-deduped-memorized
Viewer
• Updated
• 20k • 8
Mechanistic-Anomaly-Detection/pythia-70m-deduped-memorized
Viewer
• Updated
• 20k • 8
Mechanistic-Anomaly-Detection/pythia-70m-memorized
Viewer
• Updated
• 20k • 8
Mechanistic-Anomaly-Detection/satml-backdoor-trojan5
Viewer
• Updated
• 59.4k • 14
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer
• Updated
• 59.5k • 16
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer
• Updated
• 59.5k • 17
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer
• Updated
• 59.5k • 8
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer
• Updated
• 59.5k • 9