Artificial intelligence technology has made significant progress in many fields of medicine. It can integrate a large amount of genetic and molecular data, pharmacological data in the field of new drug research and development, conduct effective target screening and drug design, save drug research and development costs, and shorten drug development time. This article takes Discoidin Domain Receptor 1 (DDR1) as the target and generates potential DDR1 kinase inhibitors, which are used for treatment of fibrosis and other diseases. This article uses a modified generative tensorial reinforcement learning (GENTRL) as the core of the model, uses Self-Referencing Embedded Strings (SELFIES) instead of the most commonly used Simplified Molecular Input Line Entry System (SMILES) as the molecular encoding method, and uses a feedback mechanism and optimize model performance and generate high-quality potential DDR1 kinase inhibitors. We show that the modified GENTRL model can effectively generate highly effective molecules, and SELFIES can improve the valid percentage of generated molecules compared to SMILES and allow the model to produce highly valid molecules in a shorter time. In addition, the feedback mechanism can effectively increase the docking score of the generated molecules with target protein, generating compounds with better binding affinity to the DDR1 protein site.