Echo Lab Style Guide
Echo Lab Style Guide
Hello, and congratulations! If you’re reading this, it means you’ve passed Echo's
rigorous transcription exams with a score that puts you in the 99% percentile of
everyone who applied.
As part of our team working to provide the highest accuracy and with genuine care
for accessibility, we value your feedback on everything from platform features to
functionality. Our work surpasses traditional transcription to provide quality
captioning and ensure equitable access, which is why our contractors make 2-3x
more than other transcription roles.
You have been offered a freelance contract with the potential to work up to 40 hours
a week as jobs become available. With more universities requesting
captioning/transcription services each week, jobs will continue to be added to the
dashboard daily after being initially transcribed through AI. The Development team
is continually working to increase the accuracy of capturing with AI so you have a
complete transcription to work with.
The goal of the style guide is to proactively answer any questions you might have as
a new transcriber about formatting so, together, we can provide a consistent
experience to the schools we work with. We want them to feel that their 100th video
was captioned with the same care and attention as their very first - something that
requires tremendous coordination between all of us, at scale. If you have any
questions or clarifications on the items below, don’t hesitate to reach out to the
internal team directly on Discord or drop a question in the community channels.
TABLE OF CONTENTS
Legal Requirements
1
Quality Captioning
Speaker Identification
Atmospherics
Common Captions
Helpful Hacks
FAQ
Glossary
2
On the other hand, transcription involves converting spoken words from audio or
video content into written text. It aims to provide a textual representation of the
spoken content, making it accessible to individuals who may have difficulty hearing
or understanding the audio.
JOB RESPONSIBILITIES
HELPFUL RESOURCES
● Using a mouse and headset are HIGHLY recommended for synching captions
with audio
3
● The Google Image reverse feature can assist in identifying speakers or other
content displayed in the video
● Find and Replace extensions on your browser can improve speed for editing
4
LEGAL REQUIREMENTS
Accuracy: Captions must relay the speaker’s exact words with correct spelling,
punctuation, and grammar with 99% accuracy and no paraphrasing. Captions must
honor the original tone and intent of the speaker. Captions must match background
noises and other sounds to the fullest extent possible.
Time Synchronization: Captions must align with their corresponding spoken words
and sounds to the greatest extent possible. Captions must not proceed too quickly
for the viewer to read.
Program Completeness: Captions must be included from the beginning to the end
of the program to the fullest extent possible.
QUALITY CAPTIONING
Accurate: We aim for errorless captions.
Equal: The meaning and intention are completely preserved to maintain equal
access.
● Caption: the converted audio dialogue and sounds into text that appears on a
video and is synchronized with audio to provide equitable access and
meaning
● Caption Segment/Line: a transcribed line of dialogue that is less than 64-
characters
● Caption Stacking: Splitting a caption segment/line into two sections, each is
32-characters or less in length.
● Color Block: The corresponding audio recording for each caption.
● Red Line: The red line indicates the 32-character line limit. Any text over this
line should be stacked or segmented appropriately. There should be NO text
beyond the red line at any time, for any reason.
6
4.
7
5. A score of 1-2, will receive a strike against your three chances. The grader will
add robust feedback for improvement and you still have two more chances to
improve. Errors that are scored automatically as 1-2 include:
a. text or characters going over the red line (spaces over the line do not
count)
b. no or incorrect speaker tags
c. no atmospherics
d. inappropriate captioning (not captioning exactly what was said)
e. overuse of (unintelligible)
6. When a transcriber reaches three “strikes,” they will lose their freelance
contract with Echo Labs.
TURNAROUND TIME
1. Every video comes with an associated ‘time allotment’ which is the total
amount of time you have to complete a video. You will have a timer running
when accepting the job. We do this to ensure that all videos are delivered to
students within 24 hours of submission (See Time Chart).
2. If work is not finished within the time allotment (when the timer expires) any
work completed will be erased and partial payment is not possible, this is
why we’ve extended the time allotment to 600% of the video length itself.
3. You may request an extension for extenuating circumstances on the
appropriate Discord channel.
For accepting and completing a job with Echo Labs, follow these steps. We
recommend using a mouse and headset to simplify your editing.
SPEAKER IDENTIFICATION
Whether or not we can see who is speaking, we need to include accurate speaker
tags for students who use transcripts as a guide. Here’s how to prepare speaker tags
for everyone.
1. Speaker Tags in Brackets: Always put speaker tags in brackets and write the
speaker’s name in all uppercase letters, like this: [RACHEL ADAMS].
2. Finding Names: Try to find the speaker’s full name by looking on the
university website, professional sites, or doing an internet search. We also
have a section on Discord called #speaker-identification where many lecturers
are identified. Consult that before using a generic speaker tag.
3. Consistent Speaker Tags: Use the same speaker tag throughout the video.
Don’t shorten the speaker tag after the first use.
4. No Generic Labels: Don’t use labels like [SPEAKER ONE, TWO, THREE, ETC.].
5. Full Names: Use full names whenever possible. Do not use just first names if
both first and last names are available.
6. Avoid Gendered Tags: Don’t use gendered labels like [ACTOR, ACTRESS,
POSTMAN, ETC.]
7. When to Use Speaker Tags: Speaker tags are needed each time the speaker
changes. If one person keeps talking across several caption segments, you
only need to include a speaker tag in the first segment of their speech.
8. Numbering Speakers: Write out the numbers one through ten and then use
numerals for speakers 11 and above.
9. Labeling Hierarchy: Follow this order for labeling speaker tags: 1) Full Name,
2) Role (like STUDENT or INSTRUCTOR).
10. Multiple Speakers: When there are many speakers, use the correct tag for
who they are, not their order in the video. For example, if speaker one is [DR.
10
SMITH] and speaker two’s name isn’t given, label them [STUDENT ONE]. The
next speakers will be tagged by name if known, or by the next number.
11. Titles: Include titles if known, and use abbreviations like DR., MRS., MS., MR..,
etc.
● [NARRATOR] ● [PROFESSOR]
● [HOST] ● [INSTRUCTOR]
● [COMMENTATOR] ● [LECTURER]
● [INTERVIEWER] ● [FACULTY]
● [INTERVIEWEE] ● [SCHOLAR]
● [ACADEMIC]
● [EDUCATOR]
● [TEACHER]
● [ADVISOR]
● [STUDENT] ● [MODERATOR]
● [LEARNER] ● [FACILITATOR]
● [PUPIL] ● [CHAIRPERSON]
● [SCHOLAR] ● [MEDIATOR]
● [UNDERGRAD] ● [COORDINATOR]
● [GRADUATE] ● [HOST]
● [APPRENTICE] ● [PRESENTER]
● [TRAINEE] ● [DISCUSSION LEADER]
● [PARTICIPANT] ● [CONVENOR]
● [EMCEE]
● [VOICEOVER]
Special Roles ● [NEWS ANCHOR]
● [TRANSLATOR]
● [GUEST]
● [EXPERT]
● [PANELIST]
● [CONSULTANT]
11
● [KEYNOTE SPEAKER]
● [TUTOR]
● [AUDIENCE MEMBER]
● With our recent updates, it’s easier to know how long each segment should
be. The red line shows when you’ve reached 32 characters. If the caption goes
over the red line, it needs to be stacked.
● When the caption goes beyond the red line, split it into two lines. To do this,
click SHIFT + ENTER while typing. This will break the segment into two lines to
make it easier to read. The caption segment should NOT be split into more
than two lines.
● Deciding where to stack captions is very important. The goal is to stack
captions so that whole phrases, nouns, sentences, and dialogue flow are
interrupted as little as possible.
● Readability is very important when segmenting and stacking. Watch and
review the video captions to make sure the segments and caption stacks are
not divided over too many screens. This can make it hard for students to read.
When a sentence is broken into two lines of captions, it should be broken at a logical
point where speech normally pauses and follows standard grammar rules. The
following guidelines provide examples of appropriate segmentation.
Inappropriate
Appropriate
12
Mark pushed
his black truck.
Inappropriate
Appropriate
Mary scampered
under the table.
● Don't separate a person's name or a title from the name it goes with.
Inappropriate
Appropriate
Inappropriate
My cat is the
best friend I have.
Appropriate
My cat is
the best friend I have.
Inappropriate
Appropriate
13
In seconds she arrived,
and he ordered a drink.
Inappropriate
Appropriate
● Never end a sentence and start a new sentence on the same line unless
they are short, related sentences with one or two words.
Inappropriate
This is an example.
You're not supposed to do this.
Appropriate
Inappropriate
Appropriate
Task's assigned,
and (smacks hands) done
14
GENERAL GUIDELINES
1. Keep the Original Meaning: Each caption should keep the original
meaning, content, and important words from the spoken narration.
2. Edit Lightly: Remove filler words like "uh," "um," "you know," and "well...".
Leave out unnecessary words, false starts, stutters, and repetitions. Skip quick
interjections like "mm-hmm" unless it’s a direct response to a question.
Correct major pronunciation errors that make reading hard, but don’t change
the story. Keep grammar or pronunciation that is easily understood, like
“gonna.” Never censor or edit expletives; if there’s a beep sound, write (beep)
or (bleep) where the sound occurs. Never leave out special words, whole
sentences, or expletives. Don’t be too strict; if you’re unsure, don’t remove the
word(s).
And Kind of
6. Show hesitations in the captions - use ellipses (...) for pauses longer than
5 seconds and commas for shorter pauses.
8. Research proper nouns and terms for correct spelling (see the Glossary
for common spellings).
9. Use the spelling of words that are shown in the video, like on PowerPoint
presentations or written on the board.
10. Googling with some context from your video or audio, like speaker
names or academic topics, is helpful.
17
11. Caption URLs, hashtags, and social media tags using common formats:
www.el.ai/#echolabscaptions/@echolabs
13. Use a fitting sound for the censored word, like (beep).
,
Used when listing, to separate clauses, after filler words,
before quotes, when addressing someone, self-
interruptions, asides, or meaningful stuttering as part
of one’s speech pattern. No space before and one
space after.
[]
Use for speaker tags.
.!?
Used at the end of whole sentences. No space before
and one space after.
‘
Used in contractions and to indicate a possessive. No
space before or after.
‘’
Use single quotation marks for short quotes (less than
64 characters), answers, media titles, and internal
dialogue. Also, use them for emphasis when a speaker
stresses a specific term, phrase, or quote. Place
quotation marks at the beginning and end of the quote.
“”
Use for long and direct quotes (longer than 64
characters). Place quotation marks at the beginning of
each caption block with the quote and the end of the
quote.
–
Use an Em-dash/double hyphen to indicate that another
character/speaker has interrupted a speaker mid-
sentence. The em dash should be placed directly after
the speaker gets out before interruption (with no extra
space).
-
Use a single dash/hyphen for hyphenated words or
negative numbers within math equations ONLY.
…
Use when there is a significant pause in a sentence
(longer than 5 seconds) or when a speaker trails off with
a significant pause.. Use only when necessary and not
19
for sections with no sound.
:
Use to give emphasis, present dialogue, introduce lists
or text, and clarify composition titles.
;
DO NOT USE.
COMMAS
Inappropriate
Appropriate
A-N-T-I-O-N-E-T-T-E
ELLIPSES
1. Use an ellipsis when a caption has a significant pause, longer than 5 seconds.
2. Do not use an ellipsis to indicate that the sentence continues into the next
caption.
QUOTATION MARKS
1. Use quotation marks for direct quotes only, that would result in a formal
citation (article, speech, book, journal, and I quote… etc.). Do not use quotation
marks for internal monologue or dialogue.
1. Spaces should not be inserted before the ending punctuation, after opening
and before closing parentheses and brackets, or before/between/after the
periods of an ellipsis.
● A space should be inserted after the beginning music icon (♪) and
before the ending music icon(s). Example:
MUSIC
Incorrect
Correct
22
(frantic piano music)
Correct Examples
● Caption lyrics with music icons (♪). Use one music icon at the beginning
and end of each caption within a song, but use two music icons at the
end of the last line of a song. A space should be inserted after the
beginning music icon (♪) and before the ending music icon(s).
Correct Examples
♪ Amazing Grace
How sweet the sound ♪
NUMBERS
● Unless otherwise specified, spell out all numbers from one to ten,
but use numerals for all numbers beyond ten. Examples:
Inappropriate
Appropriate
Inappropriate
Appropriate
● Spell out any number that begins a sentence as well as any related
numbers. Example:
Inappropriate
50000
Appropriate
50,000
Inappropriate
Appropriate
Building 2 page 31
Channel 5 size 12
● Use numerals for location numbers such as phone numbers and zip
codes. Example:
Inappropriate
One, one, one, five, five five, one five, one five
Appropriate
(111) 555-1515
20048
DATES
● Use the numeral plus the lowercase "th," "st," or "nd" when a day of the
month is mentioned by itself (no month is referred to). Example:
Original Narration
"ninth"
Captioned As
● When the day precedes the month, use the numeral plus the lowercase
"th," "st," or "nd" if the ending is spoken. Example:
Original Narration
"seventeenth"
Captioned As
Original Narration
"nine" or "ninth"
Captioned As
● When the month, day, and year are spoken, use the numeral alone for
the day, even if an ending ("th," "st," or "nd") is spoken. Example:
Original Narration
"six" or "sixth"
Captioned As
1907
Original Narration
June of 1999
Captioned As
June of ‘99
TIME
I awoke at 5:17.
I awoke at 4 o'clock.
PERIODS OF TIME
● A decade should be Captioned As "the 1980s" (not "the 1980's") and "the
'50s" (not "the 50's").
● Do not use hyphens if a decade or century is in noun form. Example:
FRACTIONS
● Either spell out or use numerals for fractions, keeping this rule
consistent throughout the media. If using numerals, insert a space
between a whole number and its fraction. Example:
Numeral Used
Inappropriate
Appropriate
● If a fraction is used with "million," "billion," "trillion," etc., spell out the
fraction. Example:
Inappropriate
3/10ths
Appropriate
3/10
DOLLAR AMOUNTS
● Use the numeral plus "cents" or "¢" for amounts under one dollar.
Examples:
I need 15 cents.
● Use the dollar sign plus the numeral for dollar amounts under one
million. For whole-dollar amounts of one million and greater, spell out
"million," "billion," etc. Examples:
He owes $13,656,000.
● Use the word "dollar" only once for a range up to ten. Example:
● Use the dollar sign and numerals when captioning a range of currency
over ten dollars. Example:
MEASUREMENTS
I'm 5'8".
● For whole numbers, use numerals. For example, caption "3 cups of
sugar" instead of "three cups of sugar."
DECIMALS
● Spell out decimals, such as one point four or nine point nine seven.
Unless the content is math specific. Then use integers such as 1.4 or
9.97.
30
MATH & EQUATIONS
1 plus 2 equals 3.
● Follow the conventions for ordinal place. For example: “Zeroth”, “first in
line”, “tenth place”, “21st century”, “100th time”.
● For anything that’s not a number/digit or percentage, write out the full
word instead of the symbol. Example:
● Write it out as the speaker says it, following basic number conventions
and using integers instead of writing out the words (-10,3).
● Quadrants are labeled with Roman numerals, such as “quadrant IV”
● Axes and coordinate references are hyphenated as follows: x-
coordinate, y-axis.
ACRONYMS
Appropriate
Gigabytes
Inappropriate
gigs or GB
NON-LETTER SYMBOLS
Appropriate
2 pi r
2 times pi times r
Inappropriate
Twopir
● Spell out all units (joule, gram, ampere, volt, meter, pascal, kelvin, hertz,
coulomb and newton.)
● Spell out all functions such as “f of x” instead of f(x). When referring to
notations such as dy/dx (and all other related derivative references) in
calculus, engineering, etc, have the captions reflect what the speaker says
(e.g., “dy dx”, including a space in between). Example:
● Treat the subscript or superscript term the same way you might another term
like pi, tau, or sine, all together. Denote these as the speaker reads it. Example:
x2
x sub 2
xj2
● Anything to a power should be written as spoken. Use the integer plus the
lowercase "th," "st," or "nd" if the ending is spoken. Example:
9 squared
y to the 10th
33
12 to the 2nd
PROGRAMMING LANGUAGE
ATMOSPHERICS
Captions need to indicate sounds heard on screen. We call these identifiers
atmospherics. These are critical in providing visual indicators of non-verbal sounds to
viewers who may be hard of hearing.
2. The sound should add important context and meaning to the video
such as sounds that:
Convey emotion
Background music
3. Avoid adding atmospherics that are excessive and do not add meaning:
(lip smacking, mouse clicking, pen clicking, etc.).
4. Use parentheses ( ) and lowercase unless a proper noun is used for all
atmospherics, e.g. (Erin sketching).
35
5. Describe the sound or sounds heard on screen by following this
convention: noun + descriptor/verb in the present tense form, e.g. (water
boiling), (door slams).
UNCLAIMING A JOB
Currently, the only reason to unclaim a job is if a job is extremely poor audio
recording. When you encounter this, unclaim the job, and notify the IT
troubleshooter. When in doubt, please email support@el.ai first for guidance.
If there are repeated words, incorrect wording, or incorrect lyrics to music, please
correct and complete the job.
37
If you claim a job that contains all or partial audio in Spanish, please UNCLAIM the
job, using the “Foreign Language” reason, so that the job can be redirected to one of
our Spanish-speaking staff. DO NOT use an atmospheric such as (speaking Spanish).
EXCEPTIONS
2. If the spoken words are from a verifiable source, such as a quoted segment of
a text or poem, please caption as said / verified in writing.
SPANISH TEAM
● Spell out all number words in Spanish. Do not use numbers for Spanish.
39
Syncing the audio accurately with the captions is vital for understanding. The color
blocks at the bottom of the screen align the sound waves with the caption. If the
caption extends beyond the red line, it needs to be stacked to the next line and the
color blocks will be adjusted so that each caption corresponds to the accurate color
block of sound.
41
● You can access the video timeline by right-clicking the video screen, and
selecting “show all controls.”
42
Below, are several examples of inappropriate captioning and common errors that
have been submitted for review.
● This is an example where the transcriptionist did not ensure the caption block
continued with the speaker before submitting.
44
QUICK GUIDE
ITALICS (As of 4/21, there is no ability to italicize but it will be coming soon!)
1. A voice-over reading of a poem, book, play, journal, letter, etc. (This is also
quoted material, so quotation marks are also needed.)
4. The first time a new word is being defined, but do not italicize the word
thereafter.
1. When an entire caption is already in italicized format, use Roman type to set
off a word that would normally be italicized.
46
2. If there is only one person speaking throughout the program (including the
narrator), whether onscreen or offscreen, use Roman type with no italics.