Abstract
Having received a signal, unlike traditional speech processing, the aim of this research goal is not to identify where individual word boundaries begin and end or detect the pattern set, using supervised techniques, which comprise the signal s lexicon. The rationale that underpins this approach is therefore, not to decipher the audio signal content as this is a secondary task and assumes language content exists, but to identify what constitutes the physical structure of spoken language, in contrast to other structured phenomena. In essence, to develop an automated (artificially intelligent) intuitive ear that can detect the rhythm and structure of language with the same accuracy (or better) of the human ear. To achieve this, unsupervised learning techniques, which do not rely on prior knowledge of a specific system, underpin generic methods devised, to facilitate classification of unknown phenomena, if encountered.
Original language | English |
---|---|
Title of host publication | Proceedings of the International conference on Computer, Communication and Control Technologies (CCCT ’04) |
Editors | Hsing-Wei Chu |
Place of Publication | Austin, TX |
Publisher | International Institute of Informatics and Systemics (IIIS) |
Number of pages | 6 |
Publication status | Published - 14 Aug 2004 |
Event | International Conference on Computing, Communications and Control Technologies (CCCT'04) - University of Texas, Austin, TX, United States Duration: 14 Aug 2004 → 17 Aug 2004 https://www.iiis.org/CDs/CD2004CCCT/index.asp |
Conference
Conference | International Conference on Computing, Communications and Control Technologies (CCCT'04) |
---|---|
Country/Territory | United States |
City | Austin, TX |
Period | 14/08/04 → 17/08/04 |
Internet address |
Keywords
- Audio
- Language
- Unsupervised learning
- Significant Activity Segments (SAS)
- Detection