Abstract: In this paper, we propose methods to build a powerful and efficient Image-to-Speech captioning (Im2Sp) model. To this end, we start with importing the rich knowledge related to image ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results