To prepare for the anticipated age of human-robot symbiosis, robots should be able to interact and cooperate with humans effectively by understanding the meaning and intention of human behavior. In this paper, we define human intention as "desired behavior of the human using objects." To infer the defined human intention, a robot should learn the object affordance along with a behavior hierarchy structure. Thus, in this paper, we propose a behavior hierarchy-based affordance network (BHAN) and a behavior hierarchy-based affordance map (BHAM) to represent the object affordance, behavior hierarchy structure, and object hierarchy structure, simultaneously. Autonomous and interactive BHAN/BHAM learning algorithms are also proposed to make a robot develop the BHAN and BHAM by itself, as well as by interacting with a human. Based on the newly developed BHANs and BHAM, a robot could infer the human intention from information observed in context and from human behavior. The effectiveness of the proposed method was demonstrated through experiments on human-robot interaction with building blocks using a simulated differential wheel robot and a real human-sized humanoid robot