Open-vocabulary object detection (OVD) aims to detect arbitrary objects with user-provided text labels. Although recent progress has enhanced zero-shot detection…
Large language models (LLMs) have gained widespread adoption due to their advanced text understanding and generation capabilities. However, ensuring their…