Task preset for Azure Media Indexer

Azure Media Indexer is a Media Processor that you use to perform the following tasks: make media files and content searchable, generate closed captioning tracks and keywords, index asset files that are part of your asset.

This topic describes the task preset that you need to pass to your indexing job. For complete example, see Indexing media files with Azure Media Indexer.

Azure Media Indexer Configuration XML

The following table explains elements and attributes of the configuration XML.

Name Require Description
Input true Asset file(s) that you want to index.
Azure Media Indexer supports the following media file formats: MP4, MOV, WMV, MP3, M4A, WMA, AAC, WAV.

You can specify the file name (s) in the name or list attribute of the input element (as shown below). If you do not specify which asset file to index, the primary file is picked. If no primary asset file is set, the first file in the input asset is indexed.

To explicitly specify the asset file name, do:
<input name="TestFile.wmv" />

You can also index multiple asset files at once (up to 10 files). To do this:
- Create a text file (manifest file) and give it an .lst extension.
- Add a list of all the asset file names in your input asset to this manifest file.
- Add (upload) the manifest file to the asset.
- Specify the name of the manifest file in the input’s list attribute.
<input list="input.lst">

Note: If you add more than 10 files to the manifest file, the indexing job will fail with the 2006 error code.
metadata false Metadata for the specified asset file(s).
<metadata key="..." value="..." />

You can supply values for predefined keys.

Currently, the following keys are supported:

title and description - used to update the language model to improve speech recognition accuracy.
<metadata key="title" value="[Title of the media file]" /><metadata key="description" value="[Description of the media file]" />

username and password - used for authentication when downloading internet files via http or https.
<metadata key="username" value="[UserName]" /><metadata key="password" value="[Password]" />
The username and password values apply to all media URLs in the input manifest.
features

Added in version 1.2. Currently, the only supported feature is speech recognition ("ASR").
false The Speech Recognition feature has the following settings keys:

Language:
- The natural language to be recognized in the multimedia file.
- English, Spanish

CaptionFormats:
- a semicolon-separated list of the desired output caption formats (if any)
- ttml;webvtt


GenerateKeywords:
- A boolean flag specifying whether or not a keyword XML file is required.
- True; False.