Image Tampering Datasets, Image Manipulations Datasets... A repo intend to collect most of datasets (training and evaluation) for the image forgery detection and localization.
图像篡改检测定位数据集收集
Dataset name
mask
Image Format
Post-processing
Forgery types
Real/Forged Images
Train/Test Images
Download
Paper
Year
CASIA v1.0
No
JPEG, TIFF
Splicing, copy move, removal
800/921
CASIA v2.0
No
TIFF, JPEG, BMP
Yes
Splicing, copy move, removal
7491/5123
715/100
http://forensics.idealtest.org/#/ , https://github.com/namtpham/casia1groundtruth , https://github.com/namtpham/casia2groundtruth
CASIA Image Tampering Detection Evaluation Database
2013
Columbia Gray
No
BMP
No
Splicing
933/912
1845, 933 Authentic, 912 Spliced, 128 x 128
https://www.ee.columbia.edu/ln/dvmm/downloads/AuthSplicedDataSet/AuthSplicedDataSet.htm
A data set of authentic and spliced image blocks
2004
Columbia Color
Yes
TIFF
No
Splicing
183/180
125/45
Columbia Uncompressed Image Splicing Detection Evaluation Dataset: https://www.ee.columbia.edu/ln/dvmm/downloads/authsplcuncmp/
Detecting Image Splicing using Geometry Invariants and Camera Characteristics Consistency
2006
NIST 2016(Nimble Challenge 2016 (NC16)), NC17, MFC18
JPEG
Yes
Splicing, copy move, removal
560/564
184/50
NC17: https://www.nist.gov/itl/iad/mig/nimble-challenge-2017-evaluation ; MFC18: https://www.nist.gov/itl/iad/mig/media-forensics-challenge-2018 ; REF: https://mfc.nist.gov/#pills-resources
MFC Datasets: Large-Scale Benchmark Datasets for Media Forensic Challenge Evaluation
2019
Fantastic Reality
JPEG
No
Splicing
16592/19423
12000/1000
http://zefirus.org/MAG
The Point Where Reality Meets Fantasy: Mixed Adversarial Generators for Image Splice Detection
NeurIPS '19
Carvalho
yes
PNG
yes
splicing
100/100
94,
http://www.ic.unicamp.br/tjose/files/database-tifs-small-resolution.zip (BROKEN) http://ic.unicamp.br/~rocha/pub/downloads/2014-tiago-carvalho-thesis/tifs-database.zip
Exposing digital image forgeries by illumination color classification
TIFS '13
Realistic Tampering
TIFF
object insertion , removal.
220
https://pkorus.pl/downloads/dataset-realistic-tampering
Multi-scale analysis strategies in prnu-based tampering localization
TIFS '16
COVERAGE
TIFF
No
Copy move
100 pairs, 400 x486
https://github.com/wenbihan/coverage
COVERAGE — A novel database for copy-move forgery detection
ICIP '16
CoMoFoD
Yes
Copy move
260/260
260 sets
https://www.vcl.fer.hr/comofod/
CoMoFoD — New database for copy-move forgery detection
2013
MICC F220/F2000
copy move
110/110, 440/160, 1300/700
2200
A SIFT-Based Forensic Method for Copy–Move Attack Detection and Transformation Recovery
TIFS '11
FAU/Manip
Yes
Copy move
48
http://www5.cs.fau.de/research/data/image-manipulation/index.html
An Evaluation of Popular Copy-Move Forgery Detection Approaches
TIFS '12
Dresden
http://forensics.inf.tu-dresden.de/ddimgdb/
The Dresden Image Database for Benchmarking Digital Image Forensics
SAC '10
GRIP
Copy move
http://www.grip.unina.it/download/prog/CMFD/
Efficient dense-field copy-move forgery detection
TIFS '15
IMD2020
Yes
Splicing, copy move, removal
35000/35000
2010
http://staff.utia.cas.cz/novozada/db/
An evaluation of popular copy-move forgery detection approaches
WACV '20
In the Wild
Yes
Splicing
-/201
https://minyoungg.github.io/selfconsistency/
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
ECCV '18
DEFACTO
Splicing, copy move, removal
-/229000
https://defactodataset.github.io/
http://www.eurecom.fr/en/publication/5973/download/sec-publi-5973.pdf
PS-battles
Splicing, copy move, removal
11142/102028
https://github.com/tophatraptor/psdetector.git . Link: https://github.com/dbisUnibas/PS-Battles
The PS-battles dataset—an image collection for image manipulation detection
CoRR '18
Wild Web
Yes
PNG
splicing
90/9657
Detecting image splicing in the wild (WEB)
ICMEW '15
VIPP Synth
yes
JPEG
splicing
4800/4800
Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts
TIFS '12
VIPP Real
manual
JPEG
splicing
69/69
Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts
TIFS '12
Reddit
Image Provenance Analysis at Scale
TIP '18
AbhAS
JPEG
splicing
45/48
AbhAS: A Novel Realistic Image Splicing Forensics Dataset
Journal of Applied Security Research '22
MISD
Yes
JPEG
Yes
splicing
618/300
https://zenodo.org/records/5525829
Multiple Image Splicing Dataset (MISD): A Dataset for Multiple Splicing
Data '21
DSO-1
Yes
JPEG
splicing
100/100
https://recodbr.wordpress.com/code-n-data/#dso1_dsi1
Exposing digital image forgeries by illumination color classification
TIFS '13
DIS25k
Yes
JPG
deep image harmonization
splicing
0/24964
21376/3588
https://github.com/99eren99/DIS25k Dataset: https://www.kaggle.com/datasets/erentahir/dis25k
Deep Image Composition Meets Image Forgery
2024
tampCOCO
Yes
https://www.kaggle.com/datasets/qsii24/tampcoco
CAT-Net
IJCV '22
compRAISE
Yes
https://www.kaggle.com/datasets/qsii24/compraise
CAT-Net
IJCV '22
CocoGlide
https://www.grip.unina.it/download/prog/TruFor/CocoGlide.zip
TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization
CVPR '23
DocTamper
Yes
JPEG
Copy-Move, Splicing, AI-Generated Text-Based Forgeries
0/170,000
120,000/50,000
BaiduDrive ,Kaggle
Paper
2024
OSTF
Yes
JPEG
AI-Generated Text-Based and Image-based Forgeries
62,278/6998
GoogleDrive
Paper
2024
Paper: IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images
Project Page: http://staff.utia.cas.cz/novozada/db/
Download: http://staff.utia.cas.cz/novozada/db/
Appear in: [1]P. Zhuang, H. Li, S. Tan, B. Li, and J. Huang, “Image Tampering Localization Using a Dense Fully Convolutional Network,” Ieee T Inf Foren Sec , vol. 16, pp. 2986–2999, 2021, doi: 10.1109/tifs.2021.3070444.
Artificial PS dataset with post-processing on boundary (PS-boundary)
Artificial PS dataset with arbitrary post-processing (PS-arbitrary)
NIST 2016 dataset (NIST-2016)
Paper: The 'Dresden Image Database' for benchmarking digital image forensics SAC '10
Official: http://forensics.inf.tu-dresden.de/dresden_image_database/ (Cannot be ACCESSED)
DEFACTO database
Columbia (363 images, 180 spliced)
https://www.ee.columbia.edu/ln/dvmm/newDownloads.htm
Columbia Uncompressed Image Splicing DetectionColumbia Uncompressed Image Splicing Detection: https://www.dropbox.com/sh/786qv3yhvc7s9ki/AACbEEzGPrD3_y38bpWHzgdqa?dl=0
Columbia Image Splicing Detection Evaluation Dataset: https://www.dropbox.com/s/bo10et4p1zg08aj/ImSpliceDataset.rar?dl=0
Carvalho/DSO (200 images, 100 spliced)
Realistic Tampering (RT)/Korus (440 images, 220 spliced)
Description: This dataset contains 220 realistic forgeries created by hand in modern photo-editing software (GIMP and Affinity Photo) and covers various challenging tampering scenarios involving both object insertion and removal...
Citation: Pawel Korus and Jiwu Huang. Multi-scale analysis strategiesin prnu-based tampering localization . Trans. Info. For. Sec.,12(4):809–824, apr 2017.
Page: https://pkorus.pl/downloads/dataset-realistic-tampering
Download: realistic-tampering-dataset.zip via google drive (1.7 GB) you'll need request access
Small image category database: 512 x 512, 200 image sets, 40 images per transformation type, total number of images with post-processed images = 10400. Large image category database (please contact us for downloading): 3000 x 2000, 60 image sets, 10/20 images per transformation type, total number of images with postprocessed images = 3120.
Information Forensics and Security
Forensics Dataset
Pattern Recognition Lab, FAU, https://lme.tf.fau.de/category/dataset/
Name
Paper
Project
Download
Type
Remarks
Columbia
CASIA
MICC
IMD
CoMoFoD
COVERAGE
GRIP
FAU