German Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.07%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 101
add to inquiry list
Speech Type
  • Scripted
Domains
  • Generic
Languages
  • de-DE
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.07%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 239
Gender Distribution
  • 54W / 46M
Age Distribution
  • Over 35: 33%
  • Under 35: 67%
Speaker Metadata
  • Yes

Japanese Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.13%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 102
add to inquiry list
Speech Type
  • Scripted
Domains
  • Generic
Languages
  • jp-JP
Recording Environment
  • Quiet
Published Date
  • August 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.13%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 227
Gender Distribution
  • 78F / 22M
Age Distribution
  • Over 35: 53%
  • Under 35: 47%
Speaker Metadata
  • Yes

English-Australian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 101
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-AU
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.2%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 446
Gender Distribution
  • 65F / 35M
Age Distribution
  • Over 35: 59%
  • Under 35: 41%
Speaker Metadata
  • Yes

English-Australian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.2%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 105
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-AU
Recording Environment
  • Noisy
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.2%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 468
Gender Distribution
  • 58F / 42M
Age Distribution
  • Over 35: 58%
  • Under 35: 42%
Speaker Metadata
  • Yes

English-Great Britain Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 121
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-GB
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.2%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 467
Gender Distribution
  • 71F / 29M
Age Distribution
  • Over 35: 67%
  • Under 35: 33%
Speaker Metadata
  • Yes

English-Great Britain Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.2%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 81
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-GB
Recording Environment
  • Noisy
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.2%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 311
Gender Distribution
  • 72F / 28M
Age Distribution
  • Over 35: 69%
  • Under 35: 31%
Speaker Metadata
  • Yes

Dutch Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 636
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • nl-NL
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.2%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 2112
Gender Distribution
  • 24F / 76M
Age Distribution
  • Over 35: 48%
  • Under 35: 52%
Speaker Metadata
  • Yes

Dutch Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.2%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 536
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • nl-NL
Recording Environment
  • Noisy
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.2%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 1683
Gender Distribution
  • 19F / 81M
Age Distribution
  • Over 35: 55%
  • Under 35: 45%
Speaker Metadata
  • Yes

Spanish-Mexican Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.32%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 100
add to inquiry list
Speech Type
  • Scripted
Domains
  • Generic
Languages
  • es-MX
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.32%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 227
Gender Distribution
  • 41F / 59M
Age Distribution
  • Over 35: 31%
  • Under 35: 69%
Speaker Metadata
  • Yes

Italian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.5%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 626
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • it-IT
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.5%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 2308
Gender Distribution
  • 52F / 48M
Age Distribution
  • Over 35: 47%
  • Under 35: 53%
Speaker Metadata
  • Yes

Italian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.5%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 612
add to inquiry list
Speech Type
  • Scripted
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • it-IT
Recording Environment
  • Noisy
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.5%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 2183
Gender Distribution
  • 55F / 45M
Age Distribution
  • Over 35: 43%
  • Under 35: 57%
Speaker Metadata
  • Yes

Dutch Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.54%A measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced

Hours 103
add to inquiry list
Speech Type
  • Scripted
Domains
  • Generic
Languages
  • nl-NL
Recording Environment
  • Quiet
Published Date
  • June 2020
Audio Format
  • WAV
Sample Rate
  • 16kHz
Word Error RateA measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced
  • 0.54%
Number of Channels
  • 1
Bits per Sample
  • 16
Communication Band
  • Broadband
Recording Conditions
  • Mobile
Number of Speakers
  • 306
Gender Distribution
  • 69W / 31M
Age Distribution
  • Over 35: 61%
  • Under 35: 39%
Speaker Metadata
  • Yes