order-embedding
Vector mapping
This project maps images and their captions into a common vector space using an asymmetric partial order relation.
Implementation of caption-image retrieval from the paper "Order-Embeddings of Images and Language"
186 stars
15 watching
63 forks
Language: JavaScript
last commit: over 8 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| Lets you embed images into markup languages like LaTeX, Markdown or Typst with ease | 542 |
| Learning word embeddings from abstract images to improve language understanding | 19 |
| An implementation of a deep learning-based image representation learning approach using a modified fully connected layer and transfer learning from VGG16 | 34 |
| A PyTorch implementation of visual-semantic embedding methods for image-caption retrieval | 492 |
| A fast and efficient utility package for utilizing vector embeddings in machine learning models | 1,635 |
| A Python implementation of an image-sentence embedding method using LSTM networks. | 427 |
| A repository providing code and tools for building image-text embedding models using convolutional neural networks and word embeddings. | 287 |
| An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks. | 118 |
| A wrapper around Tesseract-OCR to simplify OCR functionality in V programming language | 17 |
| Develops a framework for multi-view 3D object detection and perception from camera images using position embedding transformation. | 881 |
| An implementation of a neural network model for learning efficient sentence representations from text data. | 205 |
| This project provides a tool for embedding metadata into images, allowing users to add copyright and licensing information to digital content. | 1 |
| A Ruby wrapper around a vector search database API for efficient similarity searches in high-dimensional space | 25 |
| A curated list of resources and papers on 3D and 2D delineation techniques for image processing and computer vision applications. | 22 |
| This is a tool for creating deep sentence embeddings using Sequence-to-Sequence learning. | 22 |