Handwritten math symbols dataset
- by user1
- 27 February, 2022
Over 100 000 image samples.
LicenseCC0: Public Domain
Tagsearth and nature, computer science, programming, law, email and messagingand 1 more
Contact
Email me at: itsawesome17@gmail.com
My blog: http://blog.mathocr.com/
Content
Dataset consists of jpg files(45×45)
DISCLAIMER: dataset does not contain Hebrew alphabet at all. It includes basic Greek alphabet symbols like: alpha, beta, gamma, mu, sigma, phi and theta.
English alphanumeric symbols are included.
All math operators, set operators.
Basic pre-defined math functions like: log, lim, cos, sin, tan.
Math symbols like: \int, \sum, \sqrt, \delta and more.
Acknowledgements
Original source, that was parsed, extracted and modified is CROHME dataset.
Visit CROHME at http://www.isical.ac.in/~crohme/index.html.
Inspiration
Due to the technological advances in recent years, paper scientific documents are used less and less. Thus, the trend in the scientific community to use digital documents has increased considerably. Among these documents, there are scientific documents and more specifically mathematics documents. So I give a tool, to research recognizing handwritten math language in variety of applications.
Source code
You can find source code responsible for parsing original CROHME dataset here:
This parser allows you not only to extract math symbols into square images of desired size, but also lets you specify categories of classes to be extracted like: digits, greekletters, lowercaseletters, operators, and more. It also contains visualization tools and histograms showing appearances of each class in the dataset.
Commercial use
Rights for commercial usage cannot be granted.
Size: 351042 KB Price: Free Author: Xai Nano Data source: kaggle.com