For Archivists ·
What you'll accomplish
A scanned handwritten document that would take hours to transcribe manually becomes a searchable text file in minutes. Transkribus uses AI models trained on millions of historical pages and handles 19th-century script, cursive correspondence, and even older scripts with 85–95% accuracy. This guide gets you set up and through your first transcription.
What you'll need
Go to transkribus.eu and click Sign Up. Enter your name, email, and create a password.
Check your email for the verification link and confirm your account.
What you should see: You're taken to the Transkribus dashboard, which shows "My Collections" with an empty list.
Troubleshooting: If the email doesn't arrive in 5 minutes, check your spam folder. Academic email domains sometimes filter transactional email.
In the Transkribus web interface, click New Collection. Give it a descriptive name (e.g., "Smith Family Papers - Correspondence" or "Board Minutes 1920-1950").
What you should see: Your new collection appears in the list with 0 documents.
Click on your collection to open it. Click Upload Documents and drag your scanned image files into the upload window. You can upload individual images or group them as a multi-page document.
What you should see: Progress bar during upload, then the document appears in your collection with a thumbnail of the first page.
Troubleshooting: If upload fails, check that files are JPG or TIFF format and under 50MB each. Very large files may need to be compressed first.
Open your document by clicking its thumbnail. You'll see the document image with an empty transcription panel on the right.
Click the Run HTR+ button in the toolbar (or look for "Automatic Text Recognition" in the Tools menu).
What you should see: A progress indicator. Processing takes 30–90 seconds per page. When complete, text appears in the transcription panel on the right side of the screen.
Troubleshooting: If no text appears, the model may not match the document's script style. Try a different pre-trained model. Very degraded or extremely small handwriting may not transcribe well with any model.
The AI transcription will have errors, typically 5–15% of words for clear documents and more for degraded or unusual handwriting. Review the transcription panel against the image and correct errors.
What you should see: Lines where the AI is uncertain are highlighted in yellow. The verification process goes much faster than transcribing from scratch.
When you've reviewed and corrected the transcription, export it in the format you need.
What you should see: A file downloads to your computer containing the transcribed text.
Use these in Claude after exporting Transkribus transcriptions for final cleanup:
Clean up obvious HTR errors in this historical document transcription while preserving original spelling and punctuation: [paste text]Identify all proper nouns (people, places, organizations) mentioned in this transcription: [paste text]Write a 2-sentence summary of this letter for a metadata description: [paste transcription]Identify the date and main subject of this letter: [paste transcription]List all dates mentioned in this document and what events they refer to: [paste transcription]