The present invention provides an apparatus and a method for constructing a library for deriving a material composition using empirical result. Which enables acceleration of research on the material-properties relationship. By applying the empirical results of the material composition, missing data of the material compositions can be statistically calculated by using supervised non-linear imputation techniques. The completed composition information of the materials is passed as an input of machine learning material-properties relationship prediction.