Skip to content
ADVERTISEMENT
Sign In
  • Sections
    • News
    • Advice
    • The Review
  • Topics
    • Data
    • Diversity, Equity, & Inclusion
    • Finance & Operations
    • International
    • Leadership & Governance
    • Teaching & Learning
    • Scholarship & Research
    • Student Success
    • Technology
    • Transitions
    • The Workplace
  • Magazine
    • Current Issue
    • Special Issues
    • Podcast: College Matters from The Chronicle
  • Newsletters
  • Virtual Events
  • Ask Chron
  • Store
    • Featured Products
    • Reports
    • Data
    • Collections
    • Back Issues
  • Jobs
    • Find a Job
    • Post a Job
    • Professional Development
    • Career Resources
    • Virtual Career Fair
  • More
  • Sections
    • News
    • Advice
    • The Review
  • Topics
    • Data
    • Diversity, Equity, & Inclusion
    • Finance & Operations
    • International
    • Leadership & Governance
    • Teaching & Learning
    • Scholarship & Research
    • Student Success
    • Technology
    • Transitions
    • The Workplace
  • Magazine
    • Current Issue
    • Special Issues
    • Podcast: College Matters from The Chronicle
  • Newsletters
  • Virtual Events
  • Ask Chron
  • Store
    • Featured Products
    • Reports
    • Data
    • Collections
    • Back Issues
  • Jobs
    • Find a Job
    • Post a Job
    • Professional Development
    • Career Resources
    • Virtual Career Fair
    Upcoming Events:
    Hands-On Career Preparation
    An AI-Driven Work Force
    Alternative Pathways
Sign In
Profhacker Logo

ProfHacker

Teaching, tech, and productivity.

Crowdsourcing Transcription: FromThePage and Scripto

By Konrad M. Lawson January 23, 2012

The 2012 Annual Meeting of the American Historical Association was packed with sessions on digital history and the digital humanities. In one time slot I counted four sessions related to digital archives, online tools, or other technology related panels. One of the panels I especially enjoyed was

To continue reading for FREE, please sign in.

Sign In

Or subscribe now to read with unlimited access for as low as $10/month.

Don’t have an account? Sign up now.

A free account provides you access to a limited number of free articles each month, plus newsletters, job postings, salary data, and exclusive store discounts.

Sign Up

The 2012 Annual Meeting of the American Historical Association was packed with sessions on digital history and the digital humanities. In one time slot I counted four sessions related to digital archives, online tools, or other technology related panels. One of the panels I especially enjoyed was Crowdsourcing History: Collaborative Online Transcription and Archives (Tweets available at #session138 #aha2012), talking about some of the projects and tools out there which involve massive crowdsourcing of the transcription of handwritten documents. Presenters included Valerie Wallace who talked about the Transcribe Bentham project, James Ginther introducing T-PEN, Tim Sherratt on Invisible Australians, and Chris Lintott on Zooniverse and the Old Weather project. You can read more about the panel at the website created for it: Crowdsourcing History.

These were all inspiring projects of an impressive scale, and all of them are success stories in terms of the rewards they have reaped from crowdsourcing. For those of us in the possession of our own stack of handwritten documents needing transcription, the question is how we might go about creating our own online interface for the hosting of the images of the documents and facilitating transcription by various users in a way that allows us to maintain some control over the results. Two tools introduced during the session, FromThePage and Scripto, both meet this need.

Scripto

The documentary transcription tool Scripto is the newest creation from the wonderful developers of Zotero and Omeka at the Center for History and New Media. Instead of creating a fully self-contained content management system to manage the documents which are to be transcribed they have, very wisely in my opinion, decided to make a library of scripts which can tie into existing platforms such as Omeka, Drupal, and soon Wordpress. This allows a division of labor which enables them to focus on the key task, the transcription features, while allowing it to smoothly tie in with a much richer platform for the hosting of documents and other materials that can serve as the front end for a project. Like some of the other CHNM projects such as Omeka, Scripto as a tool was extracted from an existing application of the tool, the Papers of the War Department where you can see an earlier version of its features in action.

ADVERTISEMENT

The project is in its early stages, but I was impressed at the two demos I have seen Sharon Leon give of Scripto at ThatCamp and this time at AHA. They have already incorporated some great features for the manipulation of the original document, and a wiki like interface that permits discussion on the transcription. I think it is safe to say that with the professionalism and solid institutional backing of CHNM behind it, Scripto is here to stay and will develop into a focused tool that is easy to install, use, and maintain. Learn more about the tool on their home page, or download the source from github.

FromThePage

FromThePage first came to my attention on the DPLA mailing list, where it received a number of compliments. The developer of this collaborative transcription tool, Ben Brumfield, did a great job at demonstrating his platform, which provides a clean and simple interface for viewing, transcribing, and text coding of keywords, people, and places in a collection of documents. The open source software, which can be hosted directly at fromthepage.com, or set up on your own Ruby on Rails-friendly server, is clearly a work of love built by someone who set out to solve his own problem by developing a tool which many of us have only dreamed of. I understand that he is looking for collaborators and institutional support for the software going forward, and with an already powerful and functioning tool to offer, if I had to recommend one new project from the past year, I can’t think of one better than FromThePage.

The killer feature that takes FromThePage well beyond other transcription interfaces I have seen, including Scripto, is the powerful yet simple wiki-like annotation and indexing feature. Using simple double brackets, or an optional automatic suggested markup feature, simple transcriptions immediately become “tagged” (though only for items explicit in a text, rather than arbitrary tags) in a truly powerful way, allowing visitors to find documents through a index of subjects, places, or people - or jump immediately from linked subjects within a text to others like it in what becomes a rich hypertext environment.

Together with version control and integration with documents hosted in the Internet Archive, I’m truly amazed at what Ben has put together with what seems to be relatively little outside support. I would love to see this as a simple “one-click install” tool that anyone with an off-the-shelf hosting setup could add to their own project. With a few more developers working with him or some solid institutional support, this project has huge potential. Visit the project homepage for some examples of projects that have used the platform, or watch Ben’s screencast describing some of the features.

ADVERTISEMENT

Ben put together a great list with a number of online transcription projects out there here, but are there other crowdsourced transcription tools and services out there, beyond the panel participants, which deserve mention?

Image: Diary, a Creative Commons Attribution (2.0) image from bdorfman’s photostream

We welcome your thoughts and questions about this article. Please email the editors or submit a letter for publication.
Share
  • Twitter
  • LinkedIn
  • Facebook
  • Email
ADVERTISEMENT
ADVERTISEMENT

More News

Photo-based illustration of scissors cutting through a flat black and white university building and a landscape bearing the image of a $100 bill.
Budget Troubles
‘Every Revenue Source Is at Risk’: Under Trump, Research Universities Are Cutting Back
Photo-based illustration of the Capitol building dome topping a jar of money.
Budget Bill
Republicans’ Plan to Tax Higher Ed and Slash Funding Advances in Congress
Allison Pingree, a Cambridge, Mass. resident, joined hundreds at an April 12 rally urging Harvard to resist President Trump's influence on the institution.
International
Trump Administration Revokes Harvard’s Ability to Enroll International Students
Photo-based illustration of an open book with binary code instead of narrative paragraphs
Culture Shift
The Reading Struggle Meets AI

From The Review

Illustration of a Gold Seal sticker embossed with President Trump's face
The Review | Essay
What Trump’s Accreditation Moves Get Right
By Samuel Negus
Illustration of a torn cold seal sticker embossed with President Trump's face
The Review | Essay
The Weaponization of Accreditation
By Greg D. Pillar, Laurie Shanderson
Protestors gather outside the Pro-Palestinian encampment on the campus of UCLA in Los Angeles on Wednesday, May 1, 2024.
The Review | Conversation
Are Colleges Rife With Antisemitism? If So, What Should Be Done?
By Evan Goldstein, Len Gutkin

Upcoming Events

Ascendium_06-10-25_Plain.png
Views on College and Alternative Pathways
Coursera_06-17-25_Plain.png
AI and Microcredentials
  • Explore Content
    • Latest News
    • Newsletters
    • Letters
    • Free Reports and Guides
    • Professional Development
    • Virtual Events
    • Chronicle Store
    • Chronicle Intelligence
    • Jobs in Higher Education
    • Post a Job
  • Know The Chronicle
    • About Us
    • Vision, Mission, Values
    • DEI at The Chronicle
    • Write for Us
    • Work at The Chronicle
    • Our Reporting Process
    • Advertise With Us
    • Brand Studio
    • Accessibility Statement
  • Account and Access
    • Manage Your Account
    • Manage Newsletters
    • Individual Subscriptions
    • Group and Institutional Access
    • Subscription & Account FAQ
  • Get Support
    • Contact Us
    • Reprints & Permissions
    • User Agreement
    • Terms and Conditions
    • Privacy Policy
    • California Privacy Policy
    • Do Not Sell My Personal Information
1255 23rd Street, N.W. Washington, D.C. 20037
© 2025 The Chronicle of Higher Education
The Chronicle of Higher Education is academe’s most trusted resource for independent journalism, career development, and forward-looking intelligence. Our readers lead, teach, learn, and innovate with insights from The Chronicle.
Follow Us
  • twitter
  • instagram
  • youtube
  • facebook
  • linkedin