Portuguese-Brazilian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 5.2%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 104
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • pt-BR
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 5.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 436
Gender distribution
  • 41F / 59M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 39%
  • Under 35: 61%

Spanish-Mexican Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 2.0%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 103
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • es-MX
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 2.0%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 331
Gender distribution
  • 37F / 63M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 41%
  • Under 35: 59%

English-Great Britain Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 2.78%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 124
add to inquiry list
Domains
  • Generic
Languages
  • en-GB
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 2.78%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 276
Gender distribution
  • 68F / 32M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 67%
  • Under 35: 33%

French-Canadian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 1.4%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 100
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • fr-CA
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 1.4%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 379
Gender distribution
  • 55F / 45M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 63%
  • Under 35: 37%

Italian Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 2.4%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 100
add to inquiry list
Domains
  • Generic
Languages
  • it-IT
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 2.4%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 300
Gender distribution
  • 57F / 43M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 41%
  • Under 35: 59%

German Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.6%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 102
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • de-DE
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.6%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 463
Gender distribution
  • 49F / 51M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 29%
  • Under 35: 71%

French-Canadian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 1.4%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 100
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • fr-CA
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 1.4%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 351
Gender distribution
  • 61F / 39M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 61%
  • Under 35: 39%

Portuguese-Brazilian Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.76%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 152
add to inquiry list
Domains
  • Generic
Languages
  • pt-BR
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.76%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 334
Gender distribution
  • 29W / 71M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 26%
  • Under 35: 74%

German Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Noisy

Word error rate: 0.6%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 100
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • de-DE
Recording environment
  • Noisy
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.6%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 477
Gender distribution
  • 51F /49M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 39%
  • Under 35: 61%

French-France Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 1.2%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 116
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • fr-FR
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 1.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 449
Gender distribution
  • 45F / 55M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 39%
  • Under 35: 61%

German Monologue Scripted Speech Data

Domains: Generic

Recording environment: Quiet

Word error rate: 0.07%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 101
add to inquiry list
Domains
  • Generic
Languages
  • de-DE
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.07%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 239
Gender distribution
  • 54W / 46M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 33%
  • Under 35: 67%

English-Australian Monologue Scripted Speech Data

Domains: Banking, Insurance, Retail, Telco

Recording environment: Quiet

Word error rate: 0.2%?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed

Hours 101
add to inquiry list
Domains
  • Banking
  • Insurance
  • Retail
  • Telco
Languages
  • en-AU
Recording environment
  • Quiet
Use cases
  • Call center
  • Conversational AI
Published date
  • June 2020
Speech type
  • Monologue
Audio format
  • WAV
Sample rate
  • 16kHz
Word error rate?A measure to indicate errors in transcription, be it words omitted, inserted or wrongly transcribed
  • 0.2%
Number of channels
  • 1
Bits per sample
  • 16
Communication band
  • Broadband
Recording conditions
  • Mobile
Number of speakers
  • 446
Gender distribution
  • 65F / 35M
Model Applications
  • Language Model
  • Acoustic Model
  • ASR Test & Benchmark
Age Distribution
  • Over 35: 59%
  • Under 35: 41%