Page 398 - AI for Good Innovate for Impact

P. 398

AI for Good Innovate for Impact

• Convert the input name to a dense embedding
• Precompute embeddings for existing names in the database and store them in:

• Elasticsearch vector index (with dense_vector)

5) Semantic Search
• Use cosine similarity between input name and stored name embeddings
• Retrieve top N semantically similar names

6) Filter Results

• Remove any names that caused rejection (e.g., exact match or prohibited)
• Filter based on policy: sensitive words, reserved names, etc.
• Prioritize names that:

• Are semantically similar but legally and contextually acceptable
• Match user’s language preference

7) Scoring & Ranking
• Combine scores:

• Rule-based score (linguistic similarity, uniqueness, policy compliance)
• SBERT score (semantic similarity)
• Weighted average or customizable scoring strategy

8) Response Generation: Return a structured response

362

393 394 395 396 397 398 399 400 401 402 403