DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Chung, Hye Won | - |
dc.contributor.advisor | 정혜원 | - |
dc.contributor.author | Cho, Seyoung | - |
dc.date.accessioned | 2021-05-11T19:33:33Z | - |
dc.date.available | 2021-05-11T19:33:33Z | - |
dc.date.issued | 2019 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=875346&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/283052 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2019.8,[iii, 27 p. :] | - |
dc.description.abstract | We consider query-based data labeling problem in which, the goal is to classify k objects in database into binary attributes. Queries are designed using the following rule. First, randomly select query difficulty(d) number of objects. Next, ask whether those objects have an even or odd number for the given attribute. Designed queries are distributed to workers using a crowdsourcing system. We consider two system models in this paper. First is crowdsourcing erasure model. In the erasure model, workers either provides the correct answer for a query if he/she knows the answer or, refuses to answer if he/she is unsure about the answer. The second is a crowdsourcing error model. In the error model, a worker always supplies an answer, but the answer can be right or wrong. In this paper, we consider the case of multiple worker groups. Workers in the same group have the same performance on queries and the same cost for raising a query. However, workers in different groups show a different performance on queries and a different cost for raising a query. In this situation, depending on how we allocate queries to each group, the total cost used to label objects may vary. In this paper, our goal is to find the optimal distribution of queries for each group to minimize the total cost of classifying object attributes. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Crowdsourcing▼aquery difficulty▼agroup▼acost▼aerasure model▼aerror model | - |
dc.subject | 크라우드소싱▼a질문 복잡도▼a집단▼a비용▼a소거 모델▼a오류 모델 | - |
dc.title | Budget distribution for efficient data labeling over crowdsourcing system | - |
dc.title.alternative | 크라우드소싱 시스템 상에서의 효율적인 데이터 라벨링을 위한 예산 배분 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | 조세영 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.