DSpace at KOASAS: Budget distribution for efficient data labeling over crowdsourcing system

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Budget distribution for efficient data labeling over crowdsourcing system크라우드소싱 시스템 상에서의 효율적인 데이터 라벨링을 위한 예산 배분

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 124
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Chung, Hye Won	-
dc.contributor.advisor	정혜원	-
dc.contributor.author	Cho, Seyoung	-
dc.date.accessioned	2021-05-11T19:33:33Z	-
dc.date.available	2021-05-11T19:33:33Z	-
dc.date.issued	2019	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=875346&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/283052	-
dc.description	학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2019.8,[iii, 27 p. :]	-
dc.description.abstract	We consider query-based data labeling problem in which, the goal is to classify k objects in database into binary attributes. Queries are designed using the following rule. First, randomly select query difficulty(d) number of objects. Next, ask whether those objects have an even or odd number for the given attribute. Designed queries are distributed to workers using a crowdsourcing system. We consider two system models in this paper. First is crowdsourcing erasure model. In the erasure model, workers either provides the correct answer for a query if he/she knows the answer or, refuses to answer if he/she is unsure about the answer. The second is a crowdsourcing error model. In the error model, a worker always supplies an answer, but the answer can be right or wrong. In this paper, we consider the case of multiple worker groups. Workers in the same group have the same performance on queries and the same cost for raising a query. However, workers in different groups show a different performance on queries and a different cost for raising a query. In this situation, depending on how we allocate queries to each group, the total cost used to label objects may vary. In this paper, our goal is to find the optimal distribution of queries for each group to minimize the total cost of classifying object attributes.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Crowdsourcing▼aquery difficulty▼agroup▼acost▼aerasure model▼aerror model	-
dc.subject	크라우드소싱▼a질문 복잡도▼a집단▼a비용▼a소거 모델▼a오류 모델	-
dc.title	Budget distribution for efficient data labeling over crowdsourcing system	-
dc.title.alternative	크라우드소싱 시스템 상에서의 효율적인 데이터 라벨링을 위한 예산 배분	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	조세영	-

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Budget distribution for efficient data labeling over crowdsourcing system크라우드소싱 시스템 상에서의 효율적인 데이터 라벨링을 위한 예산 배분

KOASAS

Communities & Collections