In this paper, we propose a vision-based all-in-one solution for augmented reality where users want to exploit unknown 3D objects in their systems. In our solution, we facilitate two time-consuming off-line processes: obtaining information, such as keyframes and keypoints, for real-time tracking of unknown 3D targets, and estimating local coordinates with a scale for accurate registration of virtual content. The proposed solution only requires images with minimal interactions. The users do not need to know about 3D markerless tracking in depth. At the end, we propose a framework for AR miniatures systems to verify the effectiveness of our solution. As a result, all developed systems worked in real-time, more than 25 fps, and showed reliable registration even in severe viewpoint changes. Our demonstration videos are available in the supplemental materials.