dgx-spark-playbooks

mirror of https://github.com/NVIDIA/dgx-spark-playbooks.git synced 2026-04-26 03:43:52 +00:00

Author	SHA1	Message	Date
Santosh Bhavani	69cd444ea7	Add NVIDIA API support with thinking tokens for Traditional Graph - Integrate NVIDIA API as alternative to Ollama for graph queries - Implement thinking tokens API with /think system message - Add min_thinking_tokens (1024) and max_thinking_tokens (2048) - Format reasoning_content with <think> tags for UI parsing - Support dynamic model/provider selection per query - Maintain Ollama fallback for backward compatibility This enables Traditional Graph to use NVIDIA's reasoning models (e.g., nvidia-nemotron-nano-9b-v2) with visible chain-of-thought. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-25 13:48:44 -07:00
Santosh Bhavani	de9c46e97e	Replace Pinecone with Qdrant for ARM64 compatibility - Migrate from Pinecone to Qdrant vector database for native ARM64 support - Add Qdrant service with automatic collection initialization in docker-compose - Implement QdrantService with UUID-based point IDs to meet Qdrant requirements - Update all API routes and frontend components to use Qdrant - Enhance Storage Connections UI with detailed stats (vectors, status, dimensions) - Add icons and tooltips to Vector DB section matching Graph DB UX	2025-10-24 23:16:44 -07:00
Santosh Bhavani	a082a8a737	feat(backend): implement LLM-enhanced query method - Add queryWithLLM method to BackendService - Retrieves top K triples from graph and uses LLM to generate answers - Supports configurable LLM model and provider selection - Uses research-backed prompt structure for KG-enhanced RAG - Includes fallback handling for LLM errors	2025-10-19 19:57:12 -07:00
GitLab CI	27fe116e71	chore: Regenerate all playbooks	2025-10-06 17:05:41 +00:00

4 Commits