KEMBAR78
MIDI Files | PDF | File Format | Tempo
0% found this document useful (0 votes)
693 views21 pages

MIDI Files

This document describes a proposed standard MIDI file format. It defines the structure of MIDI files including sequences, tracks, chunks, and event data. MIDI files can contain multiple tracks and sequences to support multiple instruments and allow transferring files between programs.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
693 views21 pages

MIDI Files

This document describes a proposed standard MIDI file format. It defines the structure of MIDI files including sequences, tracks, chunks, and event data. MIDI files can contain multiple tracks and sequences to support multiple instruments and allow transferring files between programs.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 21

Description of MIDI files 1

Introduction
This describes a proposed standard MIDI file format. MIDI files contain one or
more MIDI streams, with time information for each event. Song, sequence, and
track structures, tempo and time signature information, are all supported.
Track names and other descriptive information may be stored with the MIDI data.
This format supports multiple tracks and multiple sequences so that if the user
of a program which supports multiple tracks intends to move a file to another
one, this format can allow that to happen.

This spec defines the 8-bit binary data stream used in the file. The data can
be stored in a binary file, nibbleized, 7-bit-ized for efficient MIDI
transmission, converted to Hex ASCII, or translated symbolically to a printable
text file. This spec addresses what's in the 8-bit stream.
Sequences, Tracks, Chunks: File Block Structure
Sequence files are made up of chunks. Each chunk has a 4-character type and a
32-bit length, which is the number of bytes in the chunk. On the Macintosh,
data is passed either in the data fork of a file, or on the Clipboard. (The
file type on the Macintosh for a file in this format will be "Midi".) On any
other computer, the data is simply the contents of the file. This structure
allows future chunk types to be designed which may easily be ignored if
encountered by a program written before the chunk type is introduced. Your
programs should expect alien chunks and treat them as if they weren't there.

This proposal defines two types of chunks: a header chunk and a track chunk. A
header chunk provides a minimal amount of information pertaining to the entire
MIDI file. A track chunk contains a sequential stream of MIDI data which may
contain information for up to 16 MIDI channels. The concepts of multiple
tracks, multiple MIDI outputs, patterns, sequences, and songs may all be
implemented using several track chunks.

A MIDI file always starts with a header chunk, and is followed by one or more
track chunks.

MThd <length of header data> <header data>


MTrk <length of track data> <track data>
MTrk <length of track data> <track data> ...

Track Data Format (MTrk chunk type)


The MTrk chunk type is where actual song data is stored. It is simply a stream
of MIDI events (and non-MIDI events), preceded by delta-time values. It starts
with the string 'Mtrk' followed by an unsigned 32 bit length.

Some numbers in MTrk chunks are represented in a form called a variable-length


quantity. These numbers are represented 7 bits per byte, most significant bits
first. All bytes except the last have bit 7 set, and the last byte has bit 7
clear. If the number is between 0 and 127, it is thus represented exactly as
one byte.
Description of MIDI files 2

Table 1. Examples of variable length quantities


Number Representation
(hex) (hex)
00000000 00
00000040 40
0000007F 7F
00000080 81 00
00002000 C0 00
00003FFF FF 7F
00004000 81 80 00
00100000 C0 80 00
001FFFFF FF FF 7F
00200000 81 80 80 00
08000000 C0 80 80 00
0FFFFFFF FF FF FF 7F

The largest number which is allowed is 0FFFFFFF so that the variable-length


representation must fit in 32 bits in a routine to write variable-length
numbers. Theoretically, larger numbers are possible, but 2 x 108 96ths of a
beat at a fast tempo of 500 beats per minute is four days, long enough for any
delta-time!

Here is the syntax of an MTrk chunk:

<track data> = <MTrk event>+


<MTrk event> = <delta-time> <event>

<delta-time> is stored as a variable-length quantity. It represents the amount


of time before the following event. If the first event in a track occurs at the
very beginning of a track, or if two events occur simultaneously, a delta-time
of zero is used. Delta-times are always present. (Not storing delta-times of 0
requires at least two bytes for any other value, and most delta-times aren't
zero.) Delta-time is in some fraction of a beat (or a second, for recording a
track with SMPTE times), as specified in the header chunk.

<event> = <MIDI event> | <sysex event> | <meta-event>

<MIDI event> is any MIDI channel message. Running status is used: status bytes
may be omitted after the first byte. The first event in a file must specify
status. Delta-time is not considered an event itself: it is an integral part
of the specification. Notice that running status occurs across delta-times.

<meta-event> specifies non-MIDI information useful to this format or to


sequencers, with this syntax:

FF <type> <length> <bytes>

All meta-events begin with FF, then have an event type byte (which is always
less than 128), and then have the length of the data stored as a variable-length
quantity, and then the data itself. If there is no data, the length is 0. As
with sysex events, running status is not allowed. As with chunks, future meta-
events may be designed which may not be known to existing programs, so programs
must properly ignore meta-events which they do not recognize, and indeed, should
expect to see them. New for 0.06: programs must never ignore the length of a
meta-event which they do recognize, and they shouldn't be surprised if it's
bigger than they expected. If so, they must ignore everything past what they
Description of MIDI files 3

know about. However, they must not add anything of their own to the end of a
meta-event.

<sysex event> is used to specify a MIDI system exclusive message, or as an


"escape" to specify any arbitrary bytes to be transmitted. Unfortunately, some
synthesizer manufacturers specify that their system exclusive messages are to be
transmitted as little packets. Each packet is only part of an entire
syntactical system exclusive message, but the times they are transmitted at are
important. Examples of this are the bytes sent in a CZ patch dump, or the FB-
01's "system exclusive mode" in which microtonal data can be transmitted. To be
able to handle situations like these, two forms of <sysex event> are provided:

F0 <length> <bytes to be transmitted after F0>


F7 <length> <all bytes to be transmitted>

In both cases, <length> is stored as a variable-length quantity. It is equal to


the number of bytes following it, not including itself or the message type (F0
or F7), but all the bytes which follow, including any F7 at the end which is
intended to be transmitted. The first form, with the F0 code, is used for
syntactically complete system exclusive messages, or the first packet an a
series Q that is, messages in which the F0 should be transmitted. The second
form is used for the remainder of the packets within a syntactic sysex message,
which do not begin with F0. Of course, the F7 is not considered part of the
system exclusive message. Of course, just as in MIDI, running status is not
allowed, in this case because the length is stored as a variable-length quantity
which may or may not start with bit 7 set.

(New to 0.06) A syntactic system exclusive message must always end with an F7,
even if the real-life device didn't send one, so that you know when you've
reached the end of an entire sysex message without looking ahead to the next
event in the MIDI file. This principle is repeated and illustrated in the
paragraphs below.

The vast majority of system exclusive messages will just use the F0 format. For
instance, the transmitted message F0 43 12 00 07 F7 would be stored in a MIDI
file as F0 05 43 12 00 07 F7. As mentioned above, it is required to include the
F7 at the end so that the reader of the MIDI file knows that it has read the
entire message.

For special situations when a single system exclusive message is split up, with
parts of it being transmitted at different times, such as in a Casio CZ patch
transfer, or the FB-01's "system exclusive mode", the F7 form of sysex event is
used for each packet except the first. None of the packets would end with an F7
except the last one, which must end with an F7. There also must not be any
transmittable MIDI events in- between the packets of a multi-packet system
exclusive message. Here is an example: suppose the bytes F0 43 12 00 were to
be sent, followed by a 200-tick delay, followed by the bytes 43 12 00 43 12 00,
followed by a 100-tick delay, followed by the bytes 43 12 00 F7, this would be
in the MIDI File:

F0 03 43 12 00
81 48 200-tick delta-time
F7 06 43 12 00 43 12 00
64 100-tick delta-time
F7 04 43 12 00 F7
Description of MIDI files 4

The F7 event may also be used as an "escape" to transmit any bytes whatsoever,
including real-time bytes, song pointer, or MIDI Time Code, which are not
permitted normally in this specification. No effort should be made to interpret
the bytes used in this way. Since a system exclusive message is not being
transmitted, it is not necessary or appropriate to end the F7 event with an F7
in this case.

Header Chunk
The header chunk at the beginning of the file specifies some basic information
about the data in the file. The data section contains three 16-bit words,
stored high byte first (of course). Here's the syntax of the complete chunk:

<chunk type> <length> <format> <ntrks> <division>

As described above, <chunk type> is the four ASCII characters 'MThd'; <length>
is a 32-bit representation of the number 6 (high byte first). The first word,
format, specifies the overall organization of the file. Only three values of
format are specified:

0. the file contains a single multi-channel track


1. the file contains one or more simultaneous tracks (or MIDI outputs) of a
sequence
2. the file contains one or more sequentially independent single-track patterns

The next word, ntrks, is the number of track chunks in the file. The third
word, division, is the division of a quarter-note represented by the delta-
times in the file. (If division is negative, it represents the division of a
second represented by the delta-times in the file, so that the track can
represent events occurring in actual time instead of metrical time. It is
represented in the following way: the upper byte is one of the four values -24,
-25, -29, or -30, corresponding to the four standard SMPTE and MIDI time code
formats, and represents the number of frames per second. The second byte
(stored positive) is the resolution within a frame: typical values may be 4
(MIDI time code resolution), 8, 10, 80 (bit resolution), or 100. This system
allows exact specification of time-code-based tracks, but also allows
millisecond-based tracks by specifying 25 frames/sec and a resolution of 40
units per frame.)

Format 0, that is, one multi-channel track, is the most interchangeable


representation of data. One application of MIDI files is a simple single-track
player in a program which needs to make synthesizers make sounds, but which is
primarily concerned with something else such as mixers or sound effect boxes.
It is very desirable to be able to produce such a format, even if your program
is track-based, in order to work with these simple programs. On the other hand,
perhaps someone will write a format conversion from format 1 to format 0 which
might be so easy to use in some setting that it would save you the trouble of
putting it into your program.

Programs which support several simultaneous tracks should be able to save and
read data in format 1, a vertically one-dimensional form, that is, as a
collection of tracks. Programs which support several independent patterns
should be able to save and read data in format 2, a horizontally one-dimensional
form. Providing these minimum capabilities will ensure maximum
interchangeability.
Description of MIDI files 5

MIDI files can express tempo and time signature, and they have been chosen to do
so for transferring tempo maps from one device to another. For a format 0 file,
the tempo will be scattered through the track and the tempo map reader should
ignore the intervening events; for a format 1 file, the tempo map must (starting
in 0.04) be stored as the first track. It is polite to a tempo map reader to
offer your user the ability to make a format 0 file with just the tempo, unless
you can use format 1.

All MIDI files should specify tempo and time signature. If they don't, the time
signature is assumed to be 4/4, and the tempo 120 beats per minute. In format
0, these meta-events should occur at least at the beginning of the single multi-
channel track. In format 1, these meta- events should be contained in the first
track. In format 2, each of the temporally independent patterns should contain
at least initial time signature and tempo information.

We may decide to define other format IDs to support other structures. A program
reading an unfamiliar format ID should return an error to the user rather than
trying to read further.

Meta-Events
A few meta-events are defined herein. It is not required for every program to
support every meta-event. Meta-events initially defined include:
FF 00 02 ssss Sequence Number
This optional event, which must occur at the beginning of a track, before any
nonzero delta-times, and before any transmittable MIDI events, specifies the
number of a sequence. The number in this track corresponds to the sequence
number in the new Cue message discussed at the summer 1987 MMA meeting. In a
format 2 MIDI file, it is used to identify each "pattern" so that a "song"
sequence using the Cue message to refer to the patterns. If the ID numbers are
omitted, the sequences' locations in order in the file are used as defaults. In
a format 0 or 1 MIDI file, which only contain one sequence, this number should
be contained in the first (or only) track. If transfer of several multitrack
sequences is required, this must be done as a group of format 1 files, each with
a different sequence number.
FF 01 len text Text Event
Any amount of text describing anything. It is a good idea to put a text event
right at the beginning of a track, with the name of the track, a description of
its intended orchestration, and any other information which the user wants to
put there. Text events may also occur at other times in a track, to be used as
lyrics, or descriptions of cue points. The text in this event should be
printable ASCII characters for maximum interchange. However, other character
codes using the high-order bit may be used for interchange of files between
different programs on the same computer which supports an extended character
set. Programs on a computer which does not support non-ASCII characters should
ignore those characters.

(New for 0.06 ). Meta event types 01 through 0F are reserved for various types
of text events, each of which meets the specification of text events(above) but
is used for a different purpose:
FF 02 len text Copyright Notice
Contains a copyright notice as printable ASCII text. The notice should contain
the characters (C), the year of the copyright, and the owner of the copyright.
If several pieces of music are in the same MIDI file, all of the copyright
Description of MIDI files 6

notices should be placed together in this event so that it will be at the


beginning of the file. This event should be the first event in the first track
chunk, at time 0.
FF 03 len text Sequence/Track Name
If in a format 0 track, or the first track in a format 1 file, the name of the
sequence. Otherwise, the name of the track.
FF 04 len text Instrument Name
A description of the type of instrumentation to be used in that track. May be
used with the MIDI Prefix meta-event to specify which MIDI channel the
description applies to, or the channel may be specified as text in the event
itself.
FF 05 len text Lyric
A lyric to be sung. Generally, each syllable will be a separate lyric event
which begins at the event's time.
FF 06 len text Marker
Normally in a format 0 track, or the first track in a format 1 file. The name of
that point in the sequence, such as a rehearsal letter or section name ("First
Verse", etc.).
FF 07 len text Cue Point
A description of something happening on a film or video screen or stage at that
point in the musical score ("Car crashes into house", "curtain opens", "she
slaps his face", etc.)
FF 2F 00 End of Track
This event is not optional. It is included so that an exact ending point may be
specified for the track, so that it has an exact length, which is necessary for
tracks which are looped or concatenated.
FF 51 03 tttttt Set Tempo
In microseconds per MIDI quarter-note This event indicates a tempo change.
Another way of putting "microseconds per quarter-note" is "24ths of a
microsecond per MIDI clock". Representing tempos as time per beat instead of
beat per time allows absolutely exact long-term synchronization with a time-
based sync protocol such as SMPTE time code or MIDI time code. This amount of
accuracy provided by this tempo resolution allows a four-minute piece at 120
beats per minute to be accurate within 500 usec at the end of the piece.
Ideally, these events should only occur where MIDI clocks would be located Q
this convention is intended to guarantee, or at least increase the likelihood,
of compatibility with other synchronization devices so that a time
signature/tempo map stored in this format may easily be transferred to another
device.
FF 54 05 hr mn se fr ff SMPTE Offset
(New in 0.06 - SMPTE Format specification) This event, if present, designates
the SMPTE time at which the track chunk is supposed to start. It should be
present at the beginning of the track, that is, before any nonzero delta-times,
and before any transmittable MIDI events. The hour must be encoded with the
SMPTE format, just as it is in MIDI Time Code. In a format 1 file, the SMPTE
Offset must be stored with the tempo map, and has no meaning in any of the other
tracks. The ff field contains fractional frames, in 100ths of a frame, even in
SMPTE-based tracks which specify a different frame subdivision for delta-times.
Description of MIDI files 7

FF 58 04 nn dd cc bb Time Signature
The time signature is expressed as four numbers. nn and dd represent the
numerator and denominator of the time signature as it would be notated. The
denominator is a negative power of two: 2 represents a quarter-note, 3
represents an eighth-note, etc. The cc parameter expresses the number of MIDI
clocks in a metronome click. The bb parameter expresses the number of notated
32nd-notes in a MIDI quarter- note (24 MIDI Clocks). This was added because
there are already multiple programs which allow the user to specify that what
MIDI thinks of as a quarter-note (24 clocks) is to be notated as, or related to
in terms of, something else.

Therefore, the complete event for 6/8 time, where the metronome clicks every
three eighth-notes, but there are 24 clocks per quarter-note, 72 to the bar,
would be (in hex):
FF 58 04 06 03 24 08
That is, 6/8 time (8 is 2 to the 3rd power, so this is 06 03), 32 MIDI clocks
per dotted-quarter (24 hex!), and eight notated 32nd-notes per MIDI quarter
note.

FF 59 02 sf mi Key Signature
Sf mi Key signature
sf = -7 7 flats
sf = -1 1 flat
sf = 0 key of C
sf = 1 1 sharp
sf = 7 7 sharps
mi = 0 major key
mi = 1 minor key
FF 7F len data Sequencer-Specific Meta-Event
Special requirements for particular sequencers may use this event type: the
first byte or bytes of data is a manufacturer ID. However, as this is an
interchange format, growth of the spec proper is preferred to use of this event
type. This type of event may be used by a sequencer which elects to use this as
its only file format; sequencers with their established feature-specific formats
should probably stick to the standard features when using this format.

More on Tempo
The MIDI file format's Tempo Meta Event expresses tempo as "the amount of time
(ie, microseconds) per quarter note". For example, if a Tempo Meta-Event
contains the 3 bytes of 07 A1 20, then each quarter note should be 0x07A120 (or
500,000) microseconds long.
BPM
Normally, musicians express tempo as "the amount of quarter notes in every
minute (ie, time period)". This is the opposite of the way that the MIDI file
format expresses it.

When musicians refer to a "beat" in terms of tempo, they are referring to a


quarter note (ie, a quarter note is always 1 beat when talking about tempo,
regardless of the time signature. Yes, it's a bit confusing to non-musicians
that the time signature's "beat" may not be the same thing as the tempo's "beat"
-- it won't be unless the time signature's beat also happens to be a quarter
note. But that's the traditional definition of BPM tempo). To a musician, tempo
Description of MIDI files 8

is therefore always "how many quarter notes happen during every minute".
Musicians refer to this measurement as BPM (ie, Beats Per Minute). So a tempo of
100 BPM means that a musician must be able to play 100 steady quarter notes, one
right after the other, in one minute. That's how "fast" the "musical tempo" is
at 100 BPM. It's very important that you understand the concept of how a
musician expresses "musical tempo" (ie, BPM) in order to properly present tempo
settings to a musician, and yet be able to relate it to how the MIDI file format
expresses tempo.

To convert the Tempo Meta-Event's tempo (ie, the 3 bytes that specify the amount
of microseconds per quarter note) to BPM:

BPM = 60,000,000/(tt tt tt)

For example, a tempo of 120 BPM = 07 A1 20 microseconds per quarter note.


So why does the MIDI file format use "time per quarter note" instead of "quarter
notes per time" to specify its tempo? Well, its easier to specify more precise
tempos with the former. With BPM, sometimes you have to deal with fractional
tempos (for example, 100.3 BPM) if you want to allow a finer resolution to the
tempo. Using microseconds to express tempo offers plenty of resolution. Also,
SMPTE is a time-based protocol (ie, it's based upon seconds, minutes, and hours,
rather than a musical tempo). Therefore it's easier to relate the MIDI file's
tempo to SMPTE timing if you express it as microseconds. Many musical devices
now use SMPTE to sync their playback.
PPQN Clock
A sequencer typically uses some internal hardware timer counting off steady time
(ie, microseconds perhaps) to generate a software "PPQN clock" that counts off
the timebase (Division, see below) "ticks". In this way, the time upon which an
event occurs can be expressed to the musician in terms of a musical
bar:beat:PPQN-tick rather than how many microseconds from the start of the
playback. Remember that musicians always think in terms of a beat, not the
passage of seconds, minutes, etc.

As mentioned, the microsecond tempo value tells you how long each one of your
sequencer's "quarter notes" should be. From here, you can figure out how long
each one of your sequencer's PPQN clocks should be by dividing that microsecond
value by your MIDI file's division. For example, if your MIDI file's Division is
96 PPQN, then that means that each of your sequencer's PPQN clock ticks at the
above tempo should be 500,000 / 96 (or 5,208.3) microseconds long (ie, there
should be 5,208.3 microseconds inbetween each PPQN clock tick in order to yield
a tempo of 120 BPM at 96 PPQN. And there should always be 96 of these clock
ticks in each quarter note, 48 ticks in each eighth note, 24 ticks in each
sixteenth, etc).

Note that you can have any timebase at any tempo. For example, you can have a 96
PPQN file playing at 100 BPM just as you can have a 192 PPQN file playing at 100
BPM. You can also have a 96 PPQN file playing at either 100 BPM or 120 BPM.
Timebase and tempo are two entirely separate quantities. Of course, they both
are needed when you setup your hardware timer (ie, when you set how many
microseconds are in each PPQN tick). And of course, at slower tempos, your PPQN
clock tick is going to be longer than at faster tempos.
MIDI Clock
MIDI clock bytes are sent over MIDI, in order to sync the playback of 2 devices
(ie, one device is generating MIDI clocks at its current tempo which it
internally counts off, and the other device is syncing its playback to the
Description of MIDI files 9

receipt of these bytes). Unlike with SMPTE frames, MIDI clock bytes are sent at
a rate related to the musical tempo.

Since there are 24 MIDI Clocks in every quarter note, the length of a MIDI Clock
(ie, time inbetween each MIDI Clock message) is the microsecond tempo divided by
24. In the above example, that would be 500,000/24, or 20,833.3 microseconds in
every MIDI Clock. Alternately, you can relate this to your timebase (ie, PPQN
clock). If you have 96 PPQN, then that means that a MIDI Clock byte must occur
every 96 / 24 (ie, 4) PPQN clocks.
SMPTE
SMPTE counts off the passage of time in terms of seconds, minutes, and hours
(ie, the way that non-musicians count time). It also breaks down the seconds
into smaller units called "frames". The movie industry created SMPTE, and they
adopted 4 different frame rates. You can divide a second into 24, 25, 29, or 30
frames. Later on, even finer resolution was needed by musical devices, and so
each frame was broken down into "subframes". So, SMPTE is not directly related
to musical tempo. SMPTE time doesn't vary with "musical tempo".

Many devices use SMPTE to sync their playback. If you need to sychronize with
such a device, then you may need to deal with SMPTE timing. Of course, you're
probably still going to have to maintain some sort of PPQN clock, based upon the
passing SMPTE subframes, so that the user can adjust the tempo of the playback
in terms of BPM, and can consider the time of each event in terms of
bar:beat:tick. But since SMPTE doesn't directly relate to musical tempo, you
have to interpolate (ie, calculate) your PPQN clocks from the passing of
subframes/frames/seconds/minutes/hours (just as we previously calculated the
PPQN clock from a hardware timer counting off microseconds).

Let's take the easy example of 25 Frames and 40 SubFrames. As previously


mentioned in the discussion of Division, this is analogous to millisecond based
timing because you have 1,000 SMPTE subframes per second. (You have 25 frames
per second. Each second is divided up into 40 subframes, and you therefore have
25 * 40 subframes per second. And remember that 1,000 milliseconds are also in
every second). Every millisecond therefore means that another subframe has
passed (and vice versa). Every time you count off 40 subframes, a SMPTE frame
has passed (and vice versa). Etc.

Let's assume you desire 96 PPQN and a tempo of 500,000 microseconds. Considering
that with 25-40 Frame-SubFrame SMPTE timing 1 millisecond = 1 subframe (and
remember that 1 millisecond = 1,000 microseconds), there should be 500,000 /
1,000 (ie, 500) subframes per quarter note. Since you have 96 PPQN in every
quarter note, then every PPQN ends up being 500 / 96 subframes long, or 5.2083
milliseconds (ie, there's how we end up with that 5,208.3 microseconds PPQN
clock tick just as we did above in discussing PPQN clock). And since 1
millisecond = 1 subframe, every PPQN clock tick also equals 5.2083 subframes at
the above tempo and timebase.
Formulas
BPM = 60,000,000/MicroTempo
MicrosPerPPQN = MicroTempo/TimeBase
MicrosPerMIDIClock = MicroTempo/24
PPQNPerMIDIClock = TimeBase/24
MicrosPerSubFrame = 1000000 * Frames * SubFrames
SubFramesPerQuarterNote = MicroTempo/(Frames * SubFrames)
SubFramesPerPPQN = SubFramesPerQuarterNote/TimeBase
MicrosPerPPQN = SubFramesPerPPQN * Frames * SubFrames
Description of MIDI files 10

Division
The last two bytes of MThd indicate how many Pulses (i.e. clocks) Per Quarter
Note (abbreviated as PPQN) resolution the time-stamps are based upon, Division.
For example, if your sequencer has 96 ppqn, this field would be (in hex):
00 60

NOTE: The 2 bytes that make up the Division are stored in (Motorola) "Big
Endian" byte order, not (Intel) "Little Endian" reverse byte order.

Alternately, if the first byte of Division is negative, then this represents the
division of a second that the time-stamps are based upon. The first byte will be
-24, -25, -29, or -30, corresponding to the 4 SMPTE standards representing
frames per second. The second byte (a positive number) is the resolution within
a frame (ie, subframe). Typical values may be 4 (MIDI Time Code), 8, 10, 80
(SMPTE bit resolution), or 100. You can specify millisecond-based timing by the
data bytes of -25 and 40 subframes.

Controllers
A Controller message has a Status byte of 0xB0 to 0xBF depending upon the MIDI
channel. There are two more data bytes.

The first data byte is the Controller Number. There are 128 possible controller
numbers (ie, 0 to 127). Some numbers are defined for specific purposes. Others
are undefined, and reserved for future use.

The second byte is the "value" that the controller is to be set to.
Most controllers implement an effect even while the MIDI device is generating
sound, and the effect will be immediately noticeable. In other words, MIDI
controller messages are meant to implement various effects by a musician while
he's operating the device.

If the device is a MultiTimbral module, then each one of its Parts may respond
differently (or not at all) to a particular controller number. Each Part usually
has its own setting for every controller number, and the Part responds only to
controller messages on the same channel as that to which the Part is assigned.
So, controller messages for one Part do not affect the sound of another Part
even while that other Part is playing.

Some controllers are continuous controllers, which simply means that their value
can be set to any value within the range from 0 to 16,384 (for 14-bit
coarse/fine resolution) or 0 to 127 (for 7-bit, coarse resolution). Other
controllers are switches whose state may be either on or off. Such controllers
will usually generate only one of two values; 0 for off, and 127 for on. But, a
device should be able to respond to any received switch value from 0 to 127. If
the device implements only an "on" and "off" state, then it should regard values
of 0 to 63 as off, and any value of 64 to 127 as on.

Many (continuous) controller numbers are coarse adjustments, and have a


respective fine adjustment controller number. For example, controller #1 is the
coarse adjustment for Modulation Wheel. Using this controller number in a
message, a device's Modulation Wheel can be adjusted in large (coarse)
increments (ie, 128 steps). If finer adjustment (from a coarse setting) needs to
be made, then controller #33 is the fine adjust for Modulation Wheel. For
controllers that have coarse/fine pairs of numbers, there is thus a 14-bit
resolution to the range. In other words, the Modulation Wheel can be set from
Description of MIDI files 11

0x0000 to 0x3FFF (ie, one of 16,384 values). For this 14-bit value, bits 7 to 13
are the coarse adjust, and bits 0 to 6 are the fine adjust. For example, to set
the Modulation Wheel to 0x2005, first you have to break it up into 2 bytes (as
is done with Pitch Wheel messages). Take bits 0 to 6 and put them in a byte that
is the fine adjust. Take bits 7 to 13 and put them right-justified in a byte
that is the coarse adjust. Assuming a MIDI channel of 0, here's the coarse and
fine Mod Wheel controller messages that a device would receive (coarse adjust
first):

0xB0 0x01 0x40 ; Controller on chan 0, Mod Wheel coarse, bits 7 to 13 of


; 14-bit value right-justified (with high bit clear)
0xB0 0x33 0x05 ; Controller on chan 0, Mod Wheel fine, bits 0 to 6 of
; 14-bit value (with high bit clear)

Some devices do not implement fine adjust counterparts to coarse controllers.


For example, some devices do not implement controller #33 for Mod Wheel fine
adjust. Instead the device only recognizes and responds to the Mod Wheel coarse
controller number (#1). It is perfectly acceptable for devices to only respond
to the coarse adjustment for a controller if the device desires 7-bit (rather
than 14-bit) resolution. The device should ignore that controller's respective
fine adjust message. By the same token, if it's only desirable to make fine
adjustments to the Mod Wheel without changing its current coarse setting (or
vice versa), a device can be sent only a controller #33 message without a
preceding controller #1 message (or vice versa). Thus, if a device can respond
to both coarse and fine adjustments for a particular controller (ie, implements
the full 14-bit resolution), it should be able to deal with either the coarse or
fine controller message being sent without its counterpart following. The same
holds true for other continuous (ie, coarse/fine pairs of) controllers.

Note: In most MIDI literature, the coarse adjust is referred to with the
designation "MSB" and the fine adjust is referred to with the designation "LSB".
I prefer the terms "coarse" and "fine".

Here's a list of the defined controllers. To the left is the controller number
(ie, how the MIDI Controller message refers to a particular controller), and on
the right is its name (ie, how a human might refer to the controller). To get
more information about what a particular controller does, click on its
controller name to bring up a description. Each description shows the controller
name and number, what the range is for the third byte of the message (ie, the
"value" data byte), and what the controller does. For controllers that have
separate coarse and fine settings, both controller numbers are shown.
MIDI devices should use these controller numbers for their defined purposes, as
much as possible. For example, if the device is able to respond to Volume
controller (coarse adjustment), then it should expect that to be controller
number 7. It should not use Portamento Time controller messages to adjust
volume. That wouldn't make any sense. Other controllers, such as Foot Pedal, are
more general purpose. That pedal could be controlling the tempo on a drum box,
for example. But generally, the Foot Pedal shouldn't be used for purposes that
other controllers already are dedicated to, such as adjusting Pan position. If
there is not a defined controller number for a particular, needed purpose, a
device can use the General Purpose Sliders and Buttons, or NRPN for device
specific purposes. The device should use controller numbers 0 to 31 for coarse
adjustments, and controller numbers 32 to 63 for the respective fine
adjustments.
Description of MIDI files 12

Defined Controllers
0 Bank Select (coarse)
1 Modulation Wheel (coarse)
2 Breath controller (coarse)
4 Foot Pedal (coarse)
5 Portamento Time (coarse)
6 Data Entry (coarse)
7 Volume (coarse)
8 Balance (coarse)
10 Pan position (coarse)
11 Expression (coarse)
12 Effect Control 1 (coarse)
13 Effect Control 2 (coarse)
16 General Purpose Slider 1
17 General Purpose Slider 2
18 General Purpose Slider 3
19 General Purpose Slider 4
32 Bank Select (fine)
33 Modulation Wheel (fine)
34 Breath controller (fine)
36 Foot Pedal (fine)
37 Portamento Time (fine)
38 Data Entry (fine)
39 Volume (fine)
40 Balance (fine)
42 Pan position (fine)
43 Expression (fine)
44 Effect Control 1 (fine)
45 Effect Control 2 (fine)
64 Hold Pedal (on/off)
65 Portamento (on/off)
66 Sustenuto Pedal (on/off)
67 Soft Pedal (on/off)
68 Legato Pedal (on/off)
69 Hold 2 Pedal (on/off)
70 Sound Variation
71 Sound Timbre
72 Sound Release Time
73 Sound Attack Time
74 Sound Brightness
75 Sound Control 6
76 Sound Control 7
77 Sound Control 8
78 Sound Control 9
79 Sound Control 10
80 General Purpose Button 1 (on/off)
81 General Purpose Button 2 (on/off)
82 General Purpose Button 3 (on/off)
83 General Purpose Button 4 (on/off)
91 Effects Level
92 Tremulo Level
93 Chorus Level
94 Celeste Level
95 Phaser Level
96 Data Button increment
97 Data Button decrement
Description of MIDI files 13

98 Non-registered Parameter (fine)


99 Non-registered Parameter (coarse)
100 Registered Parameter (fine)
101 Registered Parameter (coarse)
120 All Sound Off
121 All Controllers Off
122 Local Keyboard (on/off)
123 All Notes Off
124 Omni Mode Off
125 Omni Mode On
126 Mono Operation
127 Poly Operation

MIDI Modes
Some MIDI devices can be switched in and out of Omni state. When Omni is off, a
MIDI device can only respond to Voice Category messages (ie, Status bytes of
0x80 to 0xEF) upon a limited number of channels, usually only 1. Typically, the
device allows the musician to pick one of the 16 MIDI channels that the device
will respond to. This is then referred to as the device's Base Channel. So for
example, if a device's Base Channel is set to 1, and a Voice Category message
upon channel 2 arrives at the device's MIDI IN, the device ignores that message.

Note: Virtually all modern devices allow the musician to manually choose the
Base Channel. A device may even define its own SysEx message that can change its
Base Channel. Remember that SysEx messages are of the System Common Category,
and therefore aren't (normally) tied to the Base Channel itself.

When Omni is on, a device doesn't respond to just one MIDI channel, but rather,
responds to all 16 MIDI channels. The only benefit of Omni On is that,
regardless of which channel any message is received upon, a device always
responds to the message. This mades it very foolproof for a musician to hook up
two devices and always have one device respond to the other regardless of any
MIDI channel discrepancies between the device generating the data (ie, referred
to as the transmitter) and the device receiving the data (ie, referred to as the
receiver). Of course, if the musician daisy-chains another device, and he wants
the 2 devices to play different musical parts, then he has to switch Omni Off on
both devices. Otherwise, a device with Omni On will respond to messages intended
for the other device (as well as messages intended for itself).
Note: Omni can be switched on or off with the Omni On and Omni Off controller
messages. But these messages must be received upon the device's Base Channel in
order for the device to respond to them. What this implies is that even when a
device is in Omni On state (ie, capable of responding to all 16 channels), it
still has a Base Channel for the purpose of turning Omni On or Off.

One might think that MultiTimbral devices employ Omni On. Because you typically
may choose (upto) 16 different Patches, each playing its own musical part, you
need the device to be able to respond to more than one MIDI channel so that you
can assign each Patch to a different MIDI channel. Actually, MultiTimbral
devices do not use Omni On for this purpose. Rather, the device regards itself
as having 16 separate sound modules (ie, Parts) inside of it, with each module
in Omni Off mode, and capable of being set to its own Base Channel. Usually, you
also have a "master" Base Channel which may end up having to be set the same as
one of the individual Parts. Most MultiTimbral devices offer the musician the
choice of which particular channels to use, and which to ignore (if he doesn't
need all 16 patches playing simultaneously on different channels). In this way,
he can daisy-chain another multitimbral device and use any ignored channels (on
Description of MIDI files 14

the first device) with this second device. Unfortunately, the MIDI spec has no
specific "MultiTimbral" mode message. So, a little "creative reinterpretation"
of Monophonic mode is employed, as you'll learn in a moment. In addition to Omni
On or Off, many devices can be switched between Polyphonic or Monophonic
operation.

In Polyphonic operation, a device can respond to more than one Note On upon a
given channel. In other words, it can play chords on that channel. For example,
assume that a device is responding to Voice Category messages on channel 1. If
the device receives a Note On for middle C on channel 1, it will sound that
note. If the device then receives a Note On for high C also on channel 1 (before
receiving a Note Off for middle C), the device will sound the high C as well.
Both notes will then be sounding simultaneously.

In Monophonic operation, a device can only respond to one Note On at a time upon
a given channel. It can't play chords; only single note "melodies". For example,
assume that a device is responding to Voice Category messages on channel 1. If
the device receives a Note On for middle C on channel 1, it will play that note.
If the device then receives a Note On for high C (before receiving a Note Off
for middle C), the device will automatically turn off the middle C before
playing the high C. So what's the use of forcing a device capable of playing
chords into such a Monophonic state? Well, there are lots of Monophonic
instruments in the world, for example, most brass and woodwinds. They can only
play one note at a time. If using a Trumpet Patch, a keyboard player might want
to force a device into Monophonic operation in order to better simulate a
Trumpet. Some devices have special effects that only work in Monophonic
operation such as Portamento, and smooth transition between notes (ie, skipping
the VCA attack when moving from one Note On that "overlaps" another Note On --
this is often referred to as legato and makes for a more realistic musical
performance for brass and woodwind patches). That's in theory how Mono operation
is supposed to work, but MultiTimbral devices created long after the MIDI spec
was designed, had to subvert Mono operation into Polyphonic operation in order
to come up with a "MultiTimbral mode", as you'll learn.

Note: A device can be switched between Polyphonic or Monophonic with the


Polyphonic and Monophonic controller messages. But these messages must be
received upon the device's Base Channel in order for the device to respond to
them.
Of course, a MIDI device could have Omni On and be in Polyphonic state. Or, the
device could have Omni On but be in Monophonic state. Or, the device could have
Omni Off and be in Polyphonic state. Or, the device could have Omni Off but be
in Monophonic state. There are 4 possible combinations here, and MIDI refers to
these as 4 Modes. For example, Mode 1 is the aforementioned Omni On / Polyphonic
state. Here are the 4 Modes:
Mode 1 - Omni On / Poly
The device plays all MIDI data received on all 16 MIDI channels. If a
MultiTimbral device, then it often requires the musician to manually select
which one Patch to play all 16 channels, and this setting is usually saved in
"patch memory".
Mode 2 - Omni On / Mono
The device plays only one note out of all of the MIDI data received on all 16
MIDI channels. This mode is seldom implemented because playing one note out of
all the data happening on all 16 channels is not very useful.
Description of MIDI files 15

Mode 3 - Omni Off / Poly


The device plays all MIDI data received on 1 specific MIDI channel. The musician
usually gets to choose which channel he wants that to be. If a MultiTimbral
device, then it often requires the musician to manually select which one Patch
to play that MIDI channel, and this setting is usually saved in "patch memory".
Mode 4 - Omni Off / Mono
In theory, the device plays one note at a time on 1 (or more) specific MIDI
channels. In practice, the manufacturers of MultiTimbral threw the entire
concept of Monophonic out the window, and use this for "MultiTimbral mode". On a
MultiTimbral device, this mode means that the device plays polyphonically on 1
(or more) specific MIDI channels. The Monophonic controller message has a Value
associated with it. This Value is applicable in Mode 4 (whereas it's ignored in
Mode 2), and determines how many MIDI channels are responded to. If 1, then on a
non-MultiTimbral device, this would give you a truly monophonic instrument. Of
course, on a MultiTimbral device, it gives you the same thing as Mode 3. If the
Value is 0, then a non-MultiTimbral device uses as many MIDI channels as it has
voices. So, for an 8 voice synth, it would use 8 MIDI Channels, and each of
those channels would play one note at a time. For a MultiTimbral device, if the
Value is 0, then the device uses as many MIDI channels as it has Parts. So, if a
MultiTimbral device can play only 8 patches simultaneously, then it would use 8
MIDI Channels, and each of those channels could play polyphonically.

Some devices do not support all of these modes. The device should ignore
controller messages which attempt to switch it into an unsupported state, or
switch to the next closest mode. If a device doesn't have some way of saving the
musician's choice of Mode when the unit is turned off, the device should default
to Mode 1 upon the next power up.

On final question arises. If a MultiTimbral device doesn't implement a true


monophonic mode for Mode 4, then how do you get one of its Parts to play in that
useful Monophonic state (ie, where you have Portamento and legato features)?
Well, many MultiTimbral devices allow a musician to manually enable a "Solo
Mode" per each Part. Some devices even use the Legato Pedal controller (or a
General Purpose Button controller) to enable/disable that function, so that you
can turn it on/off for each Part over MIDI.
Note: A device that can both generate MIDI messages (ie, perhaps from an
electronic piano keyboard) as well as receive MIDI messages (ie, to be played on
its internal sound circuitry), is allowed to have its transmitter set to a
different Mode and MIDI channel than its receiver, if this is desired. In fact,
on MultiTimbral devices with a keyboard, the keyboard often has to switch
between MIDI channels so that the musician can access the Parts one at a time,
without upsetting the MIDI channel assignments for those Parts.

Syncing Sequence Playback


A sequencer is a software program or hardware unit that "plays" a musical
performance complete with appropriate rhythmic and melodic inflections (ie,
plays musical notes in the context of a musical beat).

Often, it's necessary to synchronize a sequencer to some other device that is


controlling a timed playback, such as a drum box playing its internal rhythm
patterns, so that both play at the same instant and the same tempo. Several MIDI
messages are used to cue devices to start playback at a certain point in the
sequence, make sure that the devices start simultaneously, and then keep the
devices in sync until they are simultaneously stopped. One device, the master,
Description of MIDI files 16

sends these messages to the other device, the slave. The slave references its
playback to these messages.

The message that controls the playback rate (ie, ultimately tempo) is MIDI
Clock. This is sent by the master at a rate dependent upon the master's tempo.
Specifically, the master sends 24 MIDI Clocks, spaced at equal intervals, during
every quarter note interval.(12 MIDI Clocks are in an eighth note, 6 MIDI Clocks
in a 16th, etc). Therefore, when a slave device counts down the receipt of 24
MIDI Clock messages, it knows that one quarter note has passed. When the slave
counts off another 24 MIDI Clock messages, it knows that another quarter note
has passed.

For example, if a master is set at a tempo of 120 BPM (ie, there are 120 quarter
notes in every minute), the master sends a MIDI clock every 20833 microseconds.
(ie, There are 1,000,000 microseconds in a second. Therefore, there are
60,000,000 microseconds in a minute. At a tempo of 120 BPM, there are 120
quarter notes per minute. There are 24 MIDI clocks in each quarter note.
Therefore, there should be 24 * 120 MIDI Clocks per minute. So, each MIDI Clock
is sent at a rate of 60,000,000/(24 * 120) microseconds).

Alternately, if a sequencer wishes to control playback independent of tempo, it


can use Tick messages. These are sent at a rate of 1 message every 10
milliseconds. Of course, it is then up to the slave device to maintain and
update its clock based upon these messages. The slave will be doing its own
counting off of how many milliseconds are supposed to be in each "beat" at the
current tempo.

The master needs to be able to start the slave precisely when the master starts.
The master does this by sending a MIDI Start message. The MIDI Start message
alerts the slave that, upon receipt of the very next MIDI Clock message, the
slave should start the playback of its sequence. In other words, the MIDI Start
puts the slave in "play mode", and the receipt of that first MIDI Clock marks
the initial downbeat of the song (ie, MIDI Beat 0). What this means is that
(typically) the master sends out that MIDI Clock "downbeat" immediately after
the MIDI Start. (In practice, most masters allow a 1 millisecond interval
inbetween the MIDI Start and subsequent MIDI Clock messages in order to give the
slave an opportunity to prepare itself for playback). In essense, a MIDI Start
is just a warning to let the slave know that the next MIDI Clock represents the
downbeat, and playback is to start then. Of course, the slave then begins
counting off subsequent MIDI Clock messages, with every 6th being a passing 16th
note, every 12th being a passing eighth note, and every 24th being a passing
quarter note.

A master stops the slave simultaneously by sending a MIDI Stop message. The
master may then continue to send MIDI Clocks at the rate of its tempo, but the
slave should ignore these, and not advance its "song position". Of course, the
slave may use these continuing MIDI Clocks to ascertain what the master's tempo
is at all times.

Sometimes, a musician will want to start the playback point somewhere other than
at the beginning of a song (ie, he may be recording an overdub in a certain part
of the song). The master needs to tell the slave what beat to cue playback to.
The master does this by sending a Song Position Pointer message. The 2 data
bytes in a Song Position Pointer are a 14-bit value that determines the MIDI
Beat upon which to start playback. Sequences are always assumed to start on a
MIDI Beat of 0 (ie, the downbeat). Each MIDI Beat spans 6 MIDI Clocks. In other
words, each MIDI Beat is a 16th note (since there are 24 MIDI Clocks in a
Description of MIDI files 17

quarter note, therefore 4 MIDI Beats also fit in a quarter). So, a master can
sync playback to a resolution of any particular 16th note.

For example, if a Song Position value of 8 is received, then a slave should cue
playback to the third quarter note of the song. (8 MIDI beats * 6 MIDI clocks
per MIDI beat = 48 MIDI Clocks. Since there are 24 MIDI Clocks in a quarter
note, the first quarter occurs on a time of 0 MIDI Clocks, the second quarter
note occurs upon the 24th MIDI Clock, and the third quarter note occurs on the
48th MIDI Clock).

A Song Position Pointer message should not be sent while the devices are in
play. This message should only be sent while devices are stopped. Otherwise, a
slave might take too long to cue its new start point and miss a MIDI Clock that
it should be processing.

A MIDI Start always begins playback at MIDI Beat 0 (ie, the very beginning of
the song). So, when a slave receives a MIDI Start, it automatically resets its
"Song Position" to 0. If the master needs to start playback at some other point
(as set by a Song Position Pointer message), then a MIDI Continue message is
sent instead of MIDI Start. Like a MIDI Start, the MIDI Continue is immediately
followed by a MIDI Clock "downbeat" in order to start playback then. The only
difference with MIDI Continue is that this downbeat won't necessarily be the
very start of the song. The downbeat will be at whichever point the playback was
set via a Song Position Pointer message or at the point when a MIDI Stop message
was sent (whichever message last occurred). What this implies is that a slave
must always remember its "current song position" in terms of MIDI Beats. The
slave should keep track of the nearest previous MIDI beat at which it stopped
playback (ie, its stopped "Song Position"), in the anticipation that a MIDI
Continue might be received next.

Some playback devices have the capability of containing several sequences. These
are usually numbered from 0 to however many sequences there are. If 2 such
devices are synced, a musician typically may set up the sequences on each to
match the other. For example, if the master is a sequencer with a reggae bass
line for sequence 0, then a slaved drum box might have a reggae drum beat for
sequence 0. The musician can then select the same sequence number on both
devices simultaneously by having the master send a Song Select message whenever
the musician selects that sequence on the master. When a slave receives a Song
Select message, it should cue the new song at MIDI Beat 0 (ie, reset its "song
position" to 0). The master should also assume that the newly selected song will
start from beat 0. Of course, the master could send a subsequent Song Position
Pointer message (after the Song Select) to cue the slave to a different MIDI
Beat.

If a slave receives MIDI Start or MIDI Continue messages while it's in play, it
should ignore those messages. Likewise, if it receives MIDI Stop messages while
stopped, it ignores those.

Program Fragments and Example MIDI Files


Here are some of the routines to read and write variable-length numbers in MIDI
Files. These routines are in C, and use getc and putc, which read and write
single 8-bit characters from/to the files infile and outfile.

WriteVarLen (value) register long value;


{
register long buffer;
Description of MIDI files 18

buffer = value & 0x7f;


while ((value >>= 7) > 0)
{
buffer <<= 8;
buffer |= 0x80;
buffer += (value & 0x7f);
}
while (TRUE)
{
putc(buffer,outfile);
if (buffer & 0x80)
buffer >>= 8;
else
break;
}
}

doubleword ReadVarLen ()
{
register doubleword value;
register byte c;

if ((value = getc(infile)) & 0x80)


{
value &= 0x7f;
do
{
value = (value << 7) + ((c = getc(infile)) & 0x7f);
} while (c & 0x80);
}
return (value);
}

As an example, MIDI Files for the following excerpt are shown below. First, a
format 0 file is shown, with all information intermingled; then, a format 1 file
is shown with all data separated into four tracks: one for tempo and time
signature, and three for the notes. A resolution of 96 "ticks" per quarter note
is used. A time signature of 4/4 and a tempo of 120, though implied, are
explicitly stated.

Table 2. The contents of the MIDI stream represented by this example are broken
down here
Delta Time Event Code Other Bytes Comment
(decimal) (hex) (decimal)
0 FF 58 04 04 02 24 08 4 bytes: 4/4 time, 24 MIDI
clocks/click, 8 32nd notes/24 MIDI
clocks
0 FF 51 03 500000 3 bytes: 500,000 5sec per quarter-note
0 C0 5 Ch. 1, Program Change 5
0 C0 5 Ch. 1, Program Change 5
0 C1 46 Ch. 2, Program Change 46
0 C2 70 Ch. 3, Program Change 70
0 92 48 96 Ch. 3 Note On C2, forte
0 92 60 96 Ch. 3 Note On C3, forte
96 91 67 64 Ch. 2 Note On G3, mezzo-forte
Description of MIDI files 19

96 90 76 32 Ch. 1 Note On E4, piano


192 82 48 64 Ch. 3 Note Off C2, standard
0 82 60 64 Ch. 3 Note Off C3, standard
0 81 67 64 Ch. 2 Note Off G3, standard
0 80 76 64 Ch. 1 Note Off E4, standard
0 FF 2F 00 Track End

The entire format 0 MIDI file contents in hex follow. First, the header chunk:

4D 54 68 64 MThd
00 00 00 06 chunk length
00 00 format 0
00 01 one track
00 60 96 per quarter-note

Then, the track chunk. Its header, followed by the events (notice that running
status is used in places):

4D 54 72 6B MTrk
00 00 00 3B chunk length (59)

Delta-time Event Comments


00 FF 58 04 04 02 18 08 time signature
00 FF 51 03 07 A1 20 tempo
00 C0 05
00 C1 2E
00 C2 46
00 92 30 60
00 3C 60 running status
60 91 43 40
60 90 4C 20
81 40 82 30 40 two-byte delta-time
00 3C 40 running status
00 81 43 40
00 80 4C 40
00 FF 2F 00 end of track

A format 1 representation of the file is slightly different. Its header chunk:

4D 54 68 64 MThd
00 00 00 06 chunk length
00 01 format 1
00 04 four tracks
00 60 96 per quarter-note

First, the track chunk for the time signature/tempo track. Its header, followed
by the events:

4D 54 72 6B MTrk
00 00 00 14 chunk length (20)

Delta-time Event Comments


00 FF 58 04 04 02 18 08 time signature
00 FF 51 03 07 A1 20 tempo
83 00 FF 2F 00 end of track
Description of MIDI files 20

Then, the track chunk for the first music track. The MIDI convention for note
on/off running status is used in this example:

4D 54 72 6B MTrk
00 00 00 10 chunk length (16)
Description of MIDI files 21

Delta-time Event Comments


00 C0 05
81 40 90 4C 20
81 40 4C 00 Running status: note on, vel = 0
00 FF 2F 00 end of track

Then, the track chunk for the second music track:

4D 54 72 6B MTrk
00 00 00 0F chunk length (15)

Delta-time Event Comments


00 C1 2E
60 91 43 40
82 20 43 00 running status
00 FF 2F 00 end of track

Then, the track chunk for the third music track:

4D 54 72 6B MTrk
00 00 00 15 chunk length (21)

Delta-time Event Comments


00 C2 46
00 92 30 60
00 3C 60 running status
83 00 30 00 two-byte delta-time, running status
00 3C 00 running status
00 FF 2F 00 end of track

You might also like