German Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.07%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 101
add to inquiry list
Domains
  • Generic
Languages
  • de-DE
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.07%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 239
Gender distribution
  • 54W / 46M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 33%
  • Under 35: 67%

Japanese Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.13%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 102
add to inquiry list
Domains
  • Generic
Languages
  • jp-JP
Recording environment
  • Quiet
Published date
  • August 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.13%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 227
Gender distribution
  • 78F / 22M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 53%
  • Under 35: 47%

English-Australian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 101
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-AU
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 446
Gender distribution
  • 65F / 35M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 59%
  • Under 35: 41%

English-Australian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.2%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 105
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-AU
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 468
Gender distribution
  • 58F / 42M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 58%
  • Under 35: 42%

English-Great Britain Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 121
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-GB
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 467
Gender distribution
  • 71F / 29M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 67%
  • Under 35: 33%

English-Great Britain Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.2%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 81
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-GB
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 311
Gender distribution
  • 72F / 28M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 69%
  • Under 35: 31%

Dutch Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 636
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • nl-NL
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 2112
Gender distribution
  • 24F / 76M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 48%
  • Under 35: 52%

Dutch Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.2%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 536
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • nl-NL
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 1683
Gender distribution
  • 19F / 81M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 55%
  • Under 35: 45%

Spanish-Mexican Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.32%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 100
add to inquiry list
Domains
  • Generic
Languages
  • es-MX
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.32%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 227
Gender distribution
  • 41F / 59M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 31%
  • Under 35: 69%

Italian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.5%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 626
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • it-IT
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.5%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 2308
Gender distribution
  • 52F / 48M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 47%
  • Under 35: 53%

Italian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.5%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 612
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • it-IT
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.5%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 2183
Gender distribution
  • 55F / 45M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 43%
  • Under 35: 57%

Dutch Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.54%A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 103
add to inquiry list
Domains
  • Generic
Languages
  • nl-NL
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rateA measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.54%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 306
Gender distribution
  • 69W / 31M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 61%
  • Under 35: 39%