Skip to main content
2 votes
1 answer
130 views

I have embeddings (about 160 Million) that I created with a BERT-based encoder model. Right now they are in a .pt format and takes about 500GB in the disk. I want 2 things: To save them in an ...
Noam's user avatar
  • 55
0 votes
0 answers
73 views

I'm trying to compute a measure of semantic similarity between titles of scientific publications using SPECTER2, but the model performs poorly. Here is my code: from transformers import AutoTokenizer ...
robertspierre's user avatar
1 vote
1 answer
207 views

My code: from transformers import AutoTokenizer, AutoModel model_name = "NVIDIA/nv-embed-v2" tokenizer = AutoTokenizer.from_pretrained(model_name) model = AutoModel.from_pretrained(...
6zL's user avatar
  • 21
0 votes
1 answer
288 views

I'm working with a vector database (qdrant) and using semantic search via embeddings (Sentence Transformers). My use case involves queries like: "Give animals other than cats" or "List ...
Vummenthala Sai ramana reddy's user avatar
0 votes
1 answer
125 views

This part of the code, specifically the part where rag_chain is invoked causes an error: Retrying langchain_cohere.embeddings.CohereEmbeddings.embed_with_retry.<locals>._embed_with_retry in 4.0 ...
Akshitha Rao's user avatar
0 votes
1 answer
848 views

I have been having problems integrating vertex ai matching engine for embedding similarity search using the google cloud aiplatform python sdk, specifically when adding metadata to an index datapoint ...
Precious112's user avatar
1 vote
1 answer
187 views

Is it generally possible to create image embeddings directly (without additional text) and store them in a database? The aim is to make the content of the images findable later via a text input in the ...
Felix's user avatar
  • 201
0 votes
0 answers
45 views

I am a beginner in text mining analysis and am currently learning Dynamic Word Embedding (DWB) techniques. While running the replication codes from this Kaggle notebook, I encountered the following ...
ryang6476's user avatar
1 vote
1 answer
174 views

I have two sets of embeddings from two different GNNs. I want to compare the embeddings by visualization and I want to know which way is the most appropriate way for comparison. Do I fit t-SNE ...
Darkmoon Chief's user avatar
0 votes
1 answer
665 views

I need to compute embeddings for a large number of sentences (say 10K) in preprocessing, and at runtime I will have to compute the embedding vector for one sentence at a time (user query), and then ...
Alaa M.'s user avatar
  • 5,382
0 votes
1 answer
469 views

I tried a lot, but I want to read embeddings from the jina embeddings this is my java code: public static float[] getTextEmbedding(String text) throws ModelNotFoundException, MalformedModelException, ...
Richard Burkhardt's user avatar
-1 votes
1 answer
114 views

I have a large set CSVs with numeric and text data; here's a sample: Company ID Company Name Group ID Currency Amount ... 8494494 Acme Inc F942G EUR $1.56 ... 9283422A Walmart XXH3F3 AUD $5.64 ... ... ...
olives's user avatar
  • 118
0 votes
1 answer
549 views

I found in Is it possible to freeze only certain embedding weights in the embedding layer in pytorch? a nice way to freeze only some indices of an embedding layer. However, while including it in a ...
Mirco Ramo's user avatar
1 vote
0 answers
66 views

I am solving an image captioning related issue and eventually I have extract the embeddings of the tokens. One possible way is to extract the embeddings using the tokens. But I cannot do do that ...
user491683's user avatar
0 votes
1 answer
226 views

I'm building a SimpleRNN model with an Embedding layer in Keras and encountering an issue when using the Sequential API. The model summary shows the output shape as ? and the number of trainable ...
Arjun Kumar's user avatar

15 30 50 per page
1
2 3 4 5
74