mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-25 19:33:53 +00:00
- Integrate NVIDIA API as alternative to Ollama for graph queries - Implement thinking tokens API with /think system message - Add min_thinking_tokens (1024) and max_thinking_tokens (2048) - Format reasoning_content with <think> tags for UI parsing - Support dynamic model/provider selection per query - Maintain Ollama fallback for backward compatibility This enables Traditional Graph to use NVIDIA's reasoning models (e.g., nvidia-nemotron-nano-9b-v2) with visible chain-of-thought. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| arangodb-loader.ts | ||
| arangodb.ts | ||
| backend-service.ts | ||
| client-init.ts | ||
| embeddings.ts | ||
| graph-db-service.ts | ||
| graph-db-util.ts | ||
| langchain-service.ts | ||
| llm-service.ts | ||
| neo4j.ts | ||
| pinecone.ts | ||
| qdrant.ts | ||
| query-logger.ts | ||
| rag.ts | ||
| remote-backend.ts | ||
| text-processor.ts | ||
| utils.ts | ||