CAPTCHA Images

  • by user1
  • 01 March, 2022

Version 2 CAPTCHA Images

LicenseOther (specified in description)

Tagscomputer scienceimage datacomputer vision

Context

This dataset contains CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) images. Built in 1997 as way for users to identify and block bots (in order to prevent spam, DDOS etc.). They have since then been replace by reCAPTCHA because they are breakable using Artificial Intelligence (as I encourage you to do).

Content

The images are 5 letter words that can contain numbers. The images have had noise applied to them (blur and a line). They are 200 x 50 PNGs.

Acknowledgements

The dataset comes from Wilhelmy, Rodrigo & Rosas, Horacio. (2013). captcha dataset.
Thumbnail image from [Accessibility of CAPTCHAs]

Inspiration

This dataset is a perfect opportunity to attempt to make Optical Character Recognition algorithms.

Size: 512 KB Price: Free Author: Fournierp Data source: kaggle.com