The paper describes a reliable system capable to read typewritten documents and to convert them into a speech output. The basic architecture of the system is built around a commercial optical scanner connected to a Personal Computer provided with an add-on DSP card; this card represents the hearth of the speech capability of the entire system. The board, entirely developed at the research centre of Alcatel Face, allows several speech processing features, as connected words speech recognition, speech compression and Italian text to speech conversion. The text-to-speech conversion, also developed in our laboratory, is based on the segment concatenation approach where the basic segments are diphones and triphones; the current size of the segment database is in the range of 400KBytes; both male and female voices are available by means of two separate sets of segments and if required the type of voice can be changed in real-time.

Automatic document reader witj speech output capabilities

MUMOLO, ENZO
1991

Abstract

The paper describes a reliable system capable to read typewritten documents and to convert them into a speech output. The basic architecture of the system is built around a commercial optical scanner connected to a Personal Computer provided with an add-on DSP card; this card represents the hearth of the speech capability of the entire system. The board, entirely developed at the research centre of Alcatel Face, allows several speech processing features, as connected words speech recognition, speech compression and Italian text to speech conversion. The text-to-speech conversion, also developed in our laboratory, is based on the segment concatenation approach where the basic segments are diphones and triphones; the current size of the segment database is in the range of 400KBytes; both male and female voices are available by means of two separate sets of segments and if required the type of voice can be changed in real-time.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11368/2798324
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact