Modular Light Transformer

The Modular Light Transformer (MLT) is a software dedicated to text recognition. It is available for free under conditions (see below).

Description

The Modular Light Transformer is a software dedicated to text recognition. It is based on a lightweight Transformer-based neural network dealing with text-line images.

Modular Light Transformer with 4 decoder layers

The proposed software is a Python code using in particular the PyTorch library to train and evaluate models and to solve handwriting recognition tasks. The code includes: neural network architectures based on the Transformer model; pre-trained models for direct use and the associated code for performing data training and/or prediction. Models have been trained using both real data and synthetic textline images. The Python code to generate synthetic handwritten textline images, called Synthetic Text line Image Generator, is also available for free under conditions.

Examples of synthetic text-line generation of historical documents.

How to get the MLT ?

To use the MLT, you agree to comply with the terms of the CLIC license (especially the use only for research purposes) and to reference the article below :
Barrere, K., Soullard, Y., Lemaitre, A., & Coüasnon, B. (2024). Training transformer architectures on few annotated data: an application to historical handwritten text recognition. International Journal on Document Analysis and Recognition (IJDAR), 1-14.

    ℹ️ To access the Synthetic Text line Image Generator: please follow this link.

    Comments are closed.