DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kim, Dong Jun | - |
dc.contributor.advisor | 김동준 | - |
dc.contributor.author | Jung, Kihoon | - |
dc.date.accessioned | 2019-09-04T02:46:05Z | - |
dc.date.available | 2019-09-04T02:46:05Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=734091&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/267014 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전산학부, 2018.2,[v, 39 p. :] | - |
dc.description.abstract | Recent rapid advances in Machine Learning (ML) have led to many novel applications previously thought infeasible. The proliferation of various speech recognition solutions in various languages shows ML’s promise as the solution for a natural way to interact with devices. Speaker classification has been a subject with less than stellar results, using expensive multi-sensor, multi-device solutions, as well as sacrificing ease of use to achieve acceptable accuracy. We believe utilizing only a single device will greatly enhance the usability, and therefore the utility, of the technology, enabling novel applications such as preventing child abduction and facilitating at-home speech therapy, by automating previously labor-intensive tasks. This study looks at the feasibility of the idea by proposing and evaluating various methods to classify voice as to whether they are from an adult or a small child. This paper demonstrates that specific combinations of existing technologies can now deliver acceptable accuracy in child-adult voice classification without the addition of expensive equipment or cumbersome user interaction, such as training a custom ML model. We believe it is feasible that further research could further improve the technology, facilitating the realization of aforementioned use-cases, even without the aid of a vast and regularized corpus of data. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Machine Learning▼aSpeech classification▼aDeep learning▼aClustering▼aSingle source | - |
dc.subject | 기계 학습▼a음성 분류▼a딥 러닝▼a클러스터 분석▼a단일 측정기 | - |
dc.title | Classifying vocalization from parent-child conversations with a single sensor | - |
dc.title.alternative | 단일 측정기를 통한 모자 음성 분류 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전산학부, | - |
dc.contributor.alternativeauthor | 정기훈 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.