The paper describes a reliable system capable to read typewritten documents and to convert them into a speech output. The basic architecture of the system is built around a commercial optical scanner connected to a Personal Computer provided with an add-on DSP card; this card represents the hearth of the speech capability of the entire system. The board, entirely developed at the research centre of Alcatel Face, allows several speech processing features, as connected words speech recognition, speech compression and Italian text to speech conversion. The text-to-speech conversion, also developed in our laboratory, is based on the segment concatenation approach where the basic segments are diphones and triphones; the current size of the segment database is in the range of 400KBytes; both male and female voices are available by means of two separate sets of segments and if required the type of voice can be changed in real-time.

Automatic document reader witj speech output capabilities

MUMOLO, ENZO
1991-01-01

Abstract

The paper describes a reliable system capable to read typewritten documents and to convert them into a speech output. The basic architecture of the system is built around a commercial optical scanner connected to a Personal Computer provided with an add-on DSP card; this card represents the hearth of the speech capability of the entire system. The board, entirely developed at the research centre of Alcatel Face, allows several speech processing features, as connected words speech recognition, speech compression and Italian text to speech conversion. The text-to-speech conversion, also developed in our laboratory, is based on the segment concatenation approach where the basic segments are diphones and triphones; the current size of the segment database is in the range of 400KBytes; both male and female voices are available by means of two separate sets of segments and if required the type of voice can be changed in real-time.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/2798324
 Avviso

Registrazione in corso di verifica.
La registrazione di questo prodotto non è ancora stata validata in ArTS.

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact