Budget distribution for efficient data labeling over crowdsourcing system크라우드소싱 시스템 상에서의 효율적인 데이터 라벨링을 위한 예산 배분

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 124
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorChung, Hye Won-
dc.contributor.advisor정혜원-
dc.contributor.authorCho, Seyoung-
dc.date.accessioned2021-05-11T19:33:33Z-
dc.date.available2021-05-11T19:33:33Z-
dc.date.issued2019-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=875346&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/283052-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2019.8,[iii, 27 p. :]-
dc.description.abstractWe consider query-based data labeling problem in which, the goal is to classify k objects in database into binary attributes. Queries are designed using the following rule. First, randomly select query difficulty(d) number of objects. Next, ask whether those objects have an even or odd number for the given attribute. Designed queries are distributed to workers using a crowdsourcing system. We consider two system models in this paper. First is crowdsourcing erasure model. In the erasure model, workers either provides the correct answer for a query if he/she knows the answer or, refuses to answer if he/she is unsure about the answer. The second is a crowdsourcing error model. In the error model, a worker always supplies an answer, but the answer can be right or wrong. In this paper, we consider the case of multiple worker groups. Workers in the same group have the same performance on queries and the same cost for raising a query. However, workers in different groups show a different performance on queries and a different cost for raising a query. In this situation, depending on how we allocate queries to each group, the total cost used to label objects may vary. In this paper, our goal is to find the optimal distribution of queries for each group to minimize the total cost of classifying object attributes.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectCrowdsourcing▼aquery difficulty▼agroup▼acost▼aerasure model▼aerror model-
dc.subject크라우드소싱▼a질문 복잡도▼a집단▼a비용▼a소거 모델▼a오류 모델-
dc.titleBudget distribution for efficient data labeling over crowdsourcing system-
dc.title.alternative크라우드소싱 시스템 상에서의 효율적인 데이터 라벨링을 위한 예산 배분-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthor조세영-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0