Models for Handwritten Text Recognition
Collection
This collection contains models that we created at the National Archives for HTR • 6 items • Updated • 7
How to use Riksarkivet/rtmdet_regions with HTRflow:
# CLI usage # see docs: https://ai-riksarkivet.github.io/htrflow/latest/getting_started/quick_start.html htrflow pipeline <path/to/pipeline.yaml> <path/to/image>
# Python usage
from htrflow.pipeline.pipeline import Pipeline
from htrflow.pipeline.steps import Task
from htrflow.models.framework.model import ModelClass
pipeline = Pipeline(
[
Task(
ModelClass, {"model": "Riksarkivet/rtmdet_regions"}, {}
),
])RTMDet is both an instance segmentation and object detection model from OpenMMLab and was trained using MMDetection. This RTMDet model is fine-tuned to segment text regions within the documents, which enables a pre-localization text-line regions, which is a crucial step for current text-recognition models work at the text-line level.
#WIP
(WIP)
(WIP)
If you would like to learn more about the Swedish National Archives HTR pipeline or access the training data, please refer to the following resources: