Abstract
How do we evaluate media forensic techniques for detecting deepfakes? We present the Presidential Deepfakes Dataset (PDD), which consists of 32 videos, half of which are original videos and half of which are manipulated with audio impersonations, synthesized lip synchronizations, political misinformation, and situational artifacts. This dataset expands the context on which end-to-end media forensic systems can be evaluated. As an example, we evaluate the winning model of the DeepFake Detection Challenge on the PDD and find that it classifies 69% of the videos in the PDD accurately. We share this dataset publicly for researchers to evaluate their techniques with the intention of pre-bunking future misinformation attempts.
Original language | English (US) |
---|---|
Pages (from-to) | 57-72 |
Number of pages | 16 |
Journal | CEUR Workshop Proceedings |
Volume | 2942 |
State | Published - 2021 |
Event | 1st Workshop on Adverse Impacts and Collateral Effects of Artificial Intelligence Technologies, AIofAI 2021 - Montreal, Canada Duration: Aug 19 2021 → … |
Keywords
- Dataset
- Deepfakes
- DFDC
- Disinformation
- Media forensics
- Misinformation
- Politics
ASJC Scopus subject areas
- General Computer Science