Commit Graph

193 Commits

Author SHA1 Message Date
Santosh Bhavani
de9c46e97e Replace Pinecone with Qdrant for ARM64 compatibility
- Migrate from Pinecone to Qdrant vector database for native ARM64 support
- Add Qdrant service with automatic collection initialization in docker-compose
- Implement QdrantService with UUID-based point IDs to meet Qdrant requirements
- Update all API routes and frontend components to use Qdrant
- Enhance Storage Connections UI with detailed stats (vectors, status, dimensions)
- Add icons and tooltips to Vector DB section matching Graph DB UX
2025-10-24 23:16:44 -07:00
Santosh Bhavani
cfebbc7b04 Add stop.sh script 2025-10-24 22:03:47 -07:00
Santosh Bhavani
eec479197b Add Docker permission validation 2025-10-24 22:02:23 -07:00
Santosh Bhavani
07d4107da4 Merge remote-tracking branch 'upstream/main' 2025-10-24 19:51:27 -07:00
Santosh Bhavani
6e90701a9b Add document tracking to prevent duplicates 2025-10-24 19:45:41 -07:00
Santosh Bhavani
97e4be5772 Add configurable NVIDIA model support 2025-10-24 19:45:36 -07:00
Santosh Bhavani
215ce25c05 Update NVIDIA models to Nemotron Super/Nano 2025-10-24 19:45:31 -07:00
GitLab CI
6a34e25169 chore: Regenerate all playbooks 2025-10-22 19:44:23 +00:00
GitLab CI
ab0cb00e0b chore: Regenerate all playbooks 2025-10-22 18:54:29 +00:00
GitLab CI
d301ca4f84 chore: Regenerate all playbooks 2025-10-22 16:17:25 +00:00
GitLab CI
15beb4e9fc chore: Regenerate all playbooks 2025-10-21 13:09:58 +00:00
GitLab CI
c66572a74b chore: Regenerate all playbooks 2025-10-21 03:53:26 +00:00
GitLab CI
8ca84d63e9 chore: Regenerate all playbooks 2025-10-21 03:50:02 +00:00
GitLab CI
3c3578c620 chore: Regenerate all playbooks 2025-10-21 03:40:46 +00:00
GitLab CI
11f2a77ea7 chore: Regenerate all playbooks 2025-10-21 00:57:26 +00:00
Santosh Bhavani
23b5cbca4c feat(processor): add parallel processing and NVIDIA API support
- Implement parallel chunk processing with configurable concurrency
- Add direct NVIDIA API integration bypassing LangChain for better control
- Optimize for DGX Spark unified memory with batch processing
- Use concurrency of 4 for Ollama, 2 for other providers
- Add proper error handling and user stop capability
- Update NVIDIA model to Llama 3.3 Nemotron Super 49B v1.5
- Improve prompt engineering for triple extraction
2025-10-19 20:58:59 -07:00
Santosh Bhavani
12c4777eae feat(langchain): upgrade to Llama 3.3 Nemotron Super 49B
- Update LangChain service to use Llama 3.3 Nemotron Super 49B v1.5
- Adjust temperature to 0.6 for better response quality
- Increase timeout to 120s for larger model
- Add top_p, frequency_penalty, and presence_penalty parameters
- Remove deprecated response_format configuration
2025-10-19 20:57:03 -07:00
Santosh Bhavani
5be2ad78bf feat(ui): upgrade to NVIDIA Llama 3.3 Nemotron Super 49B
- Update default NVIDIA model to Llama 3.3 Nemotron Super 49B v1.5
- Update model display name and description
- Replace deprecated 70B model with newer 49B Super model
2025-10-19 20:57:00 -07:00
Santosh Bhavani
529debb633 perf(docker): increase Ollama parallel processing for DGX
- Increase OLLAMA_NUM_PARALLEL from 1 to 4 requests
- Leverage DGX Spark's unified memory architecture
- Improve throughput for concurrent inference requests
2025-10-19 20:56:58 -07:00
Santosh Bhavani
ffb0688a63 refactor(ui): improve database connection button UI
- Replace button text with icons for compact display
- Add tooltips to Refresh, Disconnect, and Clear buttons
- Improve button spacing and alignment
- Import LogOut icon for disconnect action
2025-10-19 19:57:17 -07:00
Santosh Bhavani
37ee4b63f1 feat(ui): display LLM-generated answers in RAG page
- Switch traditional graph search to use LLM-enhanced endpoint
- Display LLM-generated answer prominently above triples
- Add llmAnswer state to store and display generated answers
- Update results section to show 'Supporting Triples' when answer exists
- Pass selected LLM model and provider to API
- Improve debug logging for query modes and results
2025-10-19 19:57:15 -07:00
Santosh Bhavani
1bb48b9818 feat(component): add LLM selector to RAG query interface
- Integrate LLMSelectorCompact into RAG query component
- Make query mode cards more compact to accommodate LLM selector
- Update styling for better space utilization
- Add LLM selection section with descriptive label
2025-10-19 19:57:15 -07:00
Santosh Bhavani
db1e7760f6 feat(component): add compact LLM selector component
- Create LLMSelectorCompact component for model selection
- Support Ollama and NVIDIA models
- Load available models from localStorage
- Persist selected model and dispatch selection events
- Compact design suitable for inline placement
2025-10-19 19:57:14 -07:00
Santosh Bhavani
156bfb2e8d feat(api): update metrics route for multi-database support
- Update metrics endpoint to use getGraphDbService utility
- Support both ArangoDB and Neo4j database types
- Initialize graph database based on selected type
- Retrieve graph stats from the active database
2025-10-19 19:57:13 -07:00
Santosh Bhavani
a082a8a737 feat(backend): implement LLM-enhanced query method
- Add queryWithLLM method to BackendService
- Retrieves top K triples from graph and uses LLM to generate answers
- Supports configurable LLM model and provider selection
- Uses research-backed prompt structure for KG-enhanced RAG
- Includes fallback handling for LLM errors
2025-10-19 19:57:12 -07:00
Santosh Bhavani
d842dc996a feat(api): add LLM-enhanced graph query endpoint
- Create new /api/graph-query-llm endpoint for graph search + LLM generation
- Retrieves triples using graph search and generates answers using LLM
- Supports both traditional and vector-based graph search
- Makes traditional graph search comparable to RAG for benchmarking
2025-10-19 19:57:11 -07:00
Santosh Bhavani
8c1d2ae9f3 feat(docker): add vector search services and GPU configuration
- Add optional Pinecone and sentence-transformers services for vector search
- Configure NVIDIA GPU support with proper environment variables
- Add new environment variables for embeddings and Pinecone
- Add docker compose profiles to optionally enable vector-search
- Improve CUDA configuration for Ollama service
- Add pinecone-net network for service communication
2025-10-19 19:56:55 -07:00
Santosh Bhavani
9dc734eee5 Add NVIDIA_API_KEY support and update ollama to v0.12.6 2025-10-19 14:52:24 -05:00
GitLab CI
752eada0cb chore: Regenerate all playbooks 2025-10-18 21:48:15 +00:00
GitLab CI
505cacdbd6 chore: Regenerate all playbooks 2025-10-18 21:28:42 +00:00
GitLab CI
a6f94052b1 chore: Regenerate all playbooks 2025-10-17 17:29:40 +00:00
GitLab CI
3ed5b3b073 chore: Regenerate all playbooks 2025-10-17 00:58:35 +00:00
GitLab CI
0d9108cf14 chore: Regenerate all playbooks 2025-10-16 21:25:27 +00:00
GitLab CI
7457f31016 chore: Regenerate all playbooks 2025-10-16 21:14:27 +00:00
GitLab CI
058b5b70b2 chore: Regenerate all playbooks 2025-10-16 20:25:06 +00:00
GitLab CI
c8ab690414 chore: Regenerate all playbooks 2025-10-16 19:02:54 +00:00
GitLab CI
b4a071c721 chore: Regenerate all playbooks 2025-10-16 18:49:21 +00:00
GitLab CI
5cd142bc41 chore: Regenerate all playbooks 2025-10-16 18:35:50 +00:00
GitLab CI
2ff64d7265 chore: Regenerate all playbooks 2025-10-16 17:29:56 +00:00
GitLab CI
6dd7697210 chore: Regenerate all playbooks 2025-10-16 14:13:04 +00:00
GitLab CI
99c6530528 chore: Regenerate all playbooks 2025-10-16 13:05:16 +00:00
GitLab CI
2371189ab9 chore: Regenerate all playbooks 2025-10-15 13:32:10 +00:00
GitLab CI
8a12782b17 chore: Regenerate all playbooks 2025-10-14 14:11:25 +00:00
KJ
fc2e7847da
Remove third-party license reference from README
Removed reference to third-party licensing information.
2025-10-14 09:05:25 -04:00
KJ
daeb58d0da
Delete LICENSE-3rd-party 2025-10-14 09:05:07 -04:00
KJ
b50f58cabd
Update LICENSE-3rd-party with new copyright details
Added copyright information for multiple third-party libraries including Marimo, JAX, Unsloth, and OpenCV.
2025-10-13 22:29:59 -04:00
KJ
ecd88a2ac3
Update documentation links in README.md 2025-10-13 22:20:34 -04:00
KJ
f453234a54
Add Apache License for third-party libraries
Added Apache License 2.0 for multiple third-party libraries including marimo, jax, unsloth, numpy, plotly, and opencv.
2025-10-13 22:19:24 -04:00
GitLab CI
159d5e2b24 chore: Regenerate all playbooks 2025-10-14 01:21:39 +00:00
GitLab CI
34239a8313 chore: Regenerate all playbooks 2025-10-14 00:40:26 +00:00