mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-24 19:03:54 +00:00
- Implement parallel chunk processing with configurable concurrency - Add direct NVIDIA API integration bypassing LangChain for better control - Optimize for DGX Spark unified memory with batch processing - Use concurrency of 4 for Ollama, 2 for other providers - Add proper error handling and user stop capability - Update NVIDIA model to Llama 3.3 Nemotron Super 49B v1.5 - Improve prompt engineering for triple extraction |
||
|---|---|---|
| .. | ||
| arangodb-loader.ts | ||
| arangodb.ts | ||
| backend-service.ts | ||
| client-init.ts | ||
| embeddings.ts | ||
| graph-db-service.ts | ||
| graph-db-util.ts | ||
| langchain-service.ts | ||
| llm-service.ts | ||
| neo4j.ts | ||
| pinecone.ts | ||
| query-logger.ts | ||
| rag.ts | ||
| remote-backend.ts | ||
| text-processor.ts | ||
| utils.ts | ||