MR
Mayur Rathi
@mayurrathi
⭐ 5 GitHub stars

Vector Database Engineer

Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar

mkdir -p ./skills/vector-database-engineer && curl -sfL https://raw.githubusercontent.com/mayurrathi/awesome-agent-skills/main/skills/vector-database-engineer/SKILL.md -o ./skills/vector-database-engineer/SKILL.md

Run in terminal / PowerShell. Requires curl (Unix) or PowerShell 5+ (Windows).

Skill Content

# Vector Database Engineer


Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similarity search. Use PROACTIVELY for vector search implementation, embedding optimization, or semantic retrieval systems.


Do not use this skill when


- The task is unrelated to vector database engineer

- You need a different domain or tool outside this scope


Instructions


- Clarify goals, constraints, and required inputs.

- Apply relevant best practices and validate outcomes.

- Provide actionable steps and verification.

- If detailed examples are required, open `resources/implementation-playbook.md`.


Capabilities


- Vector database selection and architecture

- Embedding model selection and optimization

- Index configuration (HNSW, IVF, PQ)

- Hybrid search (vector + keyword) implementation

- Chunking strategies for documents

- Metadata filtering and pre/post-filtering

- Performance tuning and scaling


Use this skill when


- Building RAG (Retrieval Augmented Generation) systems

- Implementing semantic search over documents

- Creating recommendation engines

- Building image/audio similarity search

- Optimizing vector search latency and recall

- Scaling vector operations to millions of vectors


Workflow


1. Analyze data characteristics and query patterns

2. Select appropriate embedding model

3. Design chunking and preprocessing pipeline

4. Choose vector database and index type

5. Configure metadata schema for filtering

6. Implement hybrid search if needed

7. Optimize for latency/recall tradeoffs

8. Set up monitoring and reindexing strategies


Best Practices


- Choose embedding dimensions based on use case (384-1536)

- Implement proper chunking with overlap

- Use metadata filtering to reduce search space

- Monitor embedding drift over time

- Plan for index rebuilding

- Cache frequent queries

- Test recall vs latency tradeoffs

🎯 Best For

  • Claude users
  • Software engineers
  • Development teams
  • Tech leads

💡 Use Cases

  • Code quality improvement
  • Best practice enforcement

📖 How to Use This Skill

  1. 1

    Install the Skill

    Copy the install command from the Terminal tab and run it. The SKILL.md file downloads to your local skills directory.

  2. 2

    Load into Your AI Assistant

    Open Claude and reference the skill. Paste the SKILL.md content or use the system prompt tab.

  3. 3

    Apply Vector Database Engineer to Your Work

    Open your project in the AI assistant and ask it to apply the skill. Start with a small module to verify the output quality.

  4. 4

    Review and Refine

    Review AI suggestions before committing. Run tests, check for regressions, and iterate on the skill output.

❓ Frequently Asked Questions

Is Vector Database Engineer compatible with Cursor and VS Code?

Yes — this skill works with any AI coding assistant including Cursor, VS Code with Copilot, and JetBrains IDEs.

Do I need specific dependencies for Vector Database Engineer?

Check the install command and Works With section. Most code skills only require the AI assistant and your codebase.

How do I install Vector Database Engineer?

Copy the install command from the Terminal tab and run it. The skill downloads to ./skills/vector-database-engineer/SKILL.md, ready to use.

Can I customize this skill for my team?

Absolutely. Edit the SKILL.md file to add team-specific instructions, examples, or workflows.

⚠️ Common Mistakes to Avoid

Skipping validation

Always test AI-generated code changes, even for simple refactors.

Missing dependency updates

Check if the skill requires updated dependencies or new packages.

🔗 Related Skills