Skip to Main Content

Data Processing and Analysis

Transcribe audio and video

Interviews are usually documented in the form of video or audio. The library offers workshops and guidance in tools to transcribe interviews and to analyse the transcription. 

Courses in transcribing

Transcribe and analyse interviews

oTranscribe

ELAN

ELAN is an open-source transcription tool for audio and video where you can use tiers for different speakers or for speech and gestures. It allows for segmentation and registration of silences and is a good option for e.g. conversation analysis and analysis of sign language.

Automatic transcription

Interviews contain personal data, which limits which software is appropriate to use to transcribe the recording. 

If the interview material does not contain sensitive personal data, a suitable transcription tool is available in Canvas Studio. It is available to Uppsala University students, researchers and staff under the Studio tab when you are logged into Studium. The tool is intended for captioning video but can also be used to transcribe audio and video. Keep in mind that the automatic transcription needs to be edited manually as the result of automatic transcription is not flawless. The transcript will be in a subtitle format, see instructions below for how to remove everything but the actual text.

There is currently no procured software for transcribing data containing sensitive personal data for Uppsala University. Researchers can apply for projects at UPPMAX and within that project use OpenAI Whisper, free of charge. As long as Whisper is used on Bianca (NAISS SENS) at UPPMAX, it can be used for sensitive data. For non-sensitive data, Snowy on UPPMAX can be used. No web based versions of Whisper are approved for transcribing interviews.

It is also possible for researchers to purchase the Sunet speech-to-text service directly from Amberscript provided that they also sign a personal data processing agreement (DPA/PUBA) with them.