Auxiliary Information File
The evaluation specification declares certain side information to be available
to the automatic systems. This information is contained in the auxiliary inforamtion
file aux_info.ndx
| Source Condition | Source Language | Index Filename |
|---|---|---|
| BNews ASR Transcripts | English | seg_SR=bnasr_TE=eng,nat.ndx |
| Mandarin | seg_SR=bnasr_TE=man,nat.ndx | |
| BNews Manual Transcripts | English | seg_SR=bnman_TE=eng,nat.ndx |
| Mandarin | seg_SR=bnman_TE=man,nat.ndx |
| Source Condition | Source Language | Content Language | Index Filename |
|---|---|---|---|
| NWT + BNews ASR Trans. | Multilingual | Native | det_SR=nwt+bnasr_TE=mul,nat.ndx |
| English | det_SR=nwt+bnasr_TE=mul,eng.ndx | ||
| Mandarin | Native | det_SR=nwt+bnasr_TE=man,nat.ndx | |
| English | det_SR=nwt+bnasr_TE=man,eng.ndx | ||
| English | Native | det_SR=nwt+bnasr_TE=eng,nat.ndx | |
| English | Not Defined by the Eval. Spec. | ||
| NWT + BNews Manual Trans. | Multilingual | Native | det_SR=nwt+bnman_TE=mul,nat.ndx |
| English | det_SR=nwt+bnman_TE=mul,eng.ndx | ||
| Mandarin | Native | det_SR=nwt+bnman_TE=man,nat.ndx | |
| English | det_SR=nwt+bnman_TE=man,eng.ndx | ||
| English | Native | det_SR=nwt+bnman_TE=eng,nat.ndx | |
| English | Not Defined by the Eval. Spec. |
| Source Language | Content Language | Source Condition | Index/Key Filenames |
|---|---|---|---|
| Multilingual | Native | NWT + BNews ASR Trans. | ./lnk_SR=nwt+bnasr_TE=mul,nat.ndx ./lnk_SR=nwt+bnasr_TE=mul,nat.key |
| NWT + BNews Manual Trans. | ./lnk_SR=nwt+bnman_TE=mul,nat.ndx ./lnk_SR=nwt+bnman_TE=mul,nat.key | ||
| English | NWT + BNews ASR Trans. | ./lnk_SR=nwt+bnasr_TE=mul,eng.ndx ./lnk_SR=nwt+bnasr_TE=mul,eng.key | |
| NWT + BNews Manual Trans. | ./lnk_SR=nwt+bnman_TE=mul,eng.ndx ./lnk_SR=nwt+bnman_TE=mul,eng.key |
First Story Detection Index Files
| Source Condition | Source Language | Content Language | Index Filename |
|---|---|---|---|
| NWT + BNews ASR Trans. | English | Native | fsd_SR=nwt+bnasr_TE=eng,nat.ndx |
| NWT + BNews Manual Trans. | English | Native | fsd_SR=nwt+bnman_TE=eng,nat.ndx |
Tracking Index Files
There is only one test language condition for the tracking evaluation, which is
multilingual tracking. The variations are on broadcast source, test content
language, and training story source language.
For each evalution test and training condition, there is a
individual index file for each test topic. Due to the large number of index files, all
tracking Index files are stored in a single directory, 'trk_ndx',
and experiment control files
identify which topic index files consitute an evalulation. (Note this is a new format
as of August, 2000).
Subset Definitions Files
For the tracking and detection tasks, the evaluation conditions involve pooling source texts
from languages. The subset definition files below provide a way
to compute performance statistics on multiple, independent 'subsets' of an evaluation run.
The 'standard' divisions are to divide the data by source texts, Newswire and Broadcast News,
and by the test source language, English or Mandarin.
Currently, only the tracking and detection evaluation scripts support the subset definition file. To use a subset definition file, add the command line argument '-U SubsetFile' to the tracking evaluation script 'TDT3trk.pl', or for the detection evaluation script 'TDT3det.pl', add the command line option '-S SubsetFile'.
| Source Condition | Test Source Language | Test Content Language | Sourcefile Subset Definition Filename |
|---|---|---|---|
| NWT + BNews ASR Trans. or NWT + BNews Manual Trans. | Multilingual | Native or English | Subsets_TE=mul.ssd |
| NWT + BNews ASR Trans. or NWT + BNews Manual Trans. | Mandarin | Native or English | Subsets_TE=man.ssd |
| NWT + BNews ASR Trans. or NWT + BNews Manual Trans. | English | Native or English | Subsets_TE=eng.ssd |