Skip to main content

Audio

NameTypeDescription
include_if_source_hasarray                          This array allows for conditionally outputting tracks based on whether or not a specific input track exists. The tracks in the source are referred to by number reference: audio[0] refers to the first audio track.
include_conditionsarray                          This array allows for conditionally including output audio tracks based on conditions in the input file.
verifybooleanEnable or disable post transcode verification for this track.
default: true
codecenum                          
copy
aac
mpeg2_aac
mpeg4_aac
aac_lc
heaac_v1
heaac_v2
heaac_auto
s302m
mp2
pcm
mp3
ac3
aiff
alac
flac
eac3
vorbis
opus
dolby_digital
dolby_digital_plus
The audio codec to use. Selecting 'copy' will attempt to use the compressed source audio stream.
codec_providerenum                          
default
ffmpeg
mainconcept_v10
mainconcept_v11
mainconcept_v13
The codec provider to be used for encoding.
pidinteger
The audio program ID. This is only used for MPEG transport streams.
maximum: 8190
channelsinteger
The number of audio channels.
minimum: 1
maximum: 24
channel_orderenum                          
smpte
wave_extensible
Specify a way to assign PCM channels to predefined speaker positions. Supported only for WAVE output format.
default: smpte
dolby_digital_plusobjectThe parameters for Dolby Digital Plus encoding.
sample_sizeenum                          
8
16
24
32
64
The audio sample size in bits.
default: 24
sample_formatenum                          
pcm_s8
pcm_u8
pcm_f16le
pcm_f24le
pcm_f32le
pcm_f64le
pcm_f16be
pcm_f24be
pcm_f32be
pcm_f64be
pcm_s16le
pcm_s24le
pcm_s32le
pcm_s64le
pcm_s16be
pcm_s24be
pcm_s32be
pcm_s64be
pcm_u16le
pcm_u24le
pcm_u32le
pcm_u64le
pcm_u16be
pcm_u24be
pcm_u32be
pcm_u64be
The audio sample format/description.
sample_rateinteger
The audio sample rate in Hz. Typical values are 44100 and 48000. Omit to use the source sample rate.
bitrate_modeenum                          
cbr
vbr
Select between constant and variable bitrate encoding. Note that not all codecs support all bitrate modes. Omit this value to use the codec's default.
bitrate_kbnumber
The audio bitrate in kilobits per second. This is the average bitrate in the case of vbr. Not all audio codecs support this setting. Omit to use codec's default.
minimum: 1
maximum: 1024
min_bitrate_kbnumber
The minimum audio bitrate in kilobits per second. Valid for vbr only.
minimum: 1
maximum: 1024
max_bitrate_kbnumber
The maximum audio bitrate in kilobits per second. Valid for vbr only.
minimum: 1
maximum: 1024
languagestring
The audio language code. ISO-639 notation is preferred, but Hybrik will attempt to convert the passed language identifier.
default_languagestring
The default audio language code. It is used when 'language' is not set and source language cannot be converted to valid ISO-639 notation. ISO-639 notation is preferred, but Hybrik will attempt to convert the passed language identifier.
dispositionenum                          
default
dub
original
comment
lyrics
karaoke
audio_description
spoken_subtitles
clean_audio
The audio disposition.
track_namestring
The name of this audio track - will be used for mov files and MPEG-DASH (representation::id) for example. May be ignored, depending on your container format.
track_group_idstringThis indicates which Group this track belongs to. Multiple tracks with the same content but different bitrates would have the same track_group_id.
layer_idstringThis indicates which Layer this tracks belongs to. For example, this allows bundling one video layer and multiple audio layers with same bitrates but different languages.
layer_affinitiesarray                          This indicates which other layers this layer can be combined with. For example, to combine audio and video layers.
filtersarrayAn array of audio filters that will be applied in order to the output audio.
channel_designatorsarray enum                          
unknown
left
right
front_left
front_right
front_center
back_left
back_right
front_left_of_center
front_right_of_center
back_center
side_left
side_right
left_height
right_height
center
lfe_screen
left_surround
right_surround
left_center
right_center
center_surround
left_surround_direct
right_surround_direct
top_center_surround
vertical_height_left
vertical_height_center
vertical_height_right
top_back_left
top_back_center
top_back_right
top_front_left
top_front_center
top_front_right
rear_surround_left
rear_surround_right
left_wide
right_wide
lfe2
left_total
right_total
hearing_impaired
narration
mono
dialog_centric_mix
center_surround_direct
haptic
headphones_left
headphones_right
click_track
foreign_language
discrete
discrete_0
discrete_1
discrete_2
discrete_3
discrete_4
discrete_5
discrete_6
discrete_7
discrete_8
discrete_9
discrete_10
discrete_11
discrete_12
discrete_13
discrete_14
discrete_15
*****
convert_aac_headersenum                          
adts_to_asc
For solving aac transmux issues between mp4 and ts/raw tracks.
aac_header_intervalintegerAllowing to solve specific hardware playback compliance problems.
dialnormnumber
string
Dialogue Level (aka dialogue normalization or dialnorm) is the average dialogue level of a program over time, measured with an LAEq meter, referenced to 0 dBFS.
mainconcept_stream_mux_optionsstring
Provide direct stream instruction to the MainConcept multiplexer. Values are constructed as "prop=val,prop=val". See MainConcept documentation for valid values.
mainconcept_audio_optionsstring
MainConcept specific codec options - please reference the mainconcept codec documentation.
mainconcept_audio_profileenum                          
MPEG1
MPEG2
DVB
DVHS
VCD
SVCD
DVD
DVD_MPEG1
DVD_DVR
DVD_DVR_MPEG1
MMV
HDV_HD1
HDV_HD2
One of the preset values for profile (e.g. mpeg1).
pcm_wrappingenum                          
raw
bwf
aes
The type of wrapping to use for PCM audio tracks.
metadataobject*****
ffmpeg_argsstring
The FFmpeg (target) command line arguments to be used. Note that these will override competing settings in the JSON.
dolbyeobjectThe parameters for Dolby E encoding.

Example audio

{
"uid": "transcode_task",
"kind": "transcode",
"payload": {
"location": {
"storage_provider": "s3",
"path": "s3://my_bucket/my_folder"
},
"targets": [
{
"file_pattern": "{source_basename}_output{default_extension}",
"existing_files": "replace",
"container": {
"kind": "mpegts"
},
"video": {
"width": 1280,
"height": 720,
"codec": "h264"
},
"audio": [
{
"codec": "ac3",
"pid": 482,
"channels": 6,
"sample_rate": 48000,
"bitrate_kb": 384
},
{
"codec": "aac_lc",
"pid": 483,
"channels": 2,
"sample_rate": 48000,
"bitrate_kb": 128
}
]
}
]
}
}