上一条: Action-Aware Embedding Enhancement for Image-Text Retrieval
下一条: Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity