An image processing apparatus obtains a first output image by applying an image to a first training network model, obtains a second output image by applying the image to a second training network model, and obtains a reconstructed image based on the first output image and the second output image. The first training network model is a model that uses a fixed parameter obtained through training of a plurality of sample images, the second training network model is trained to minimize a difference between a target image corresponding to the image and the reconstructed image.