Version 1.1

Last updated 17 days ago

Version 1.1

We are excited to announce the release of GenAI Studio v1.1.0, featuring significant enhancements to boost your GenAI development and deployment experience.

✨ What's New?

🔧 Revamped Management Interface & Optimized Architecture

Entirely refactored GenAI Studio management interface
Enhanced architecture to better support Inference Apps such as Flux.1 Schnell (Text to Image) and ScrapeGraphAI

Easier management and streamlined deployment of diverse GenAI applications

🔄 RAGOps Auto-Sync

Automatic document synchronization from designated folders directly to your vector databases
Significantly improved RAG (Retrieval-Augmented Generation) workflow efficiency

📊 Real-time System Monitoring

Integrated Grafana and Prometheus for real-time system performance tracking
Proactively detect and address potential issues before they escalate

🚀 Model Conversion & Inference Runtime

Model conversion functionality has been added, along with the availability of the EdgeAI SDK for inference-side downloads and deployment. This makes it easier to deploy models to edge devices.

🎯 Enhanced Fine-Tuning with LORA

Integrated LORA (Low-Rank Adaptation) fine-tuning support via Unsloth

⚙️ Upgrades & Maintenance

AnythingLLM updated to v1.7.5, providing the latest features and improved security.
Phison Firmware upgraded to NXUN202.00, enhancing hardware performance and stability.

Third-Party Updates

GenAI Studio utilizes the following components:

node-exporter (1.8.2) Exposes desired host metrics to Prometheus.
dcgm-exporter (4.0.0-4.0.1-ubuntu22.04) Exposes host GPU metrics to Prometheus.
Prometheus (3.1.0) Collects desired metrics as a data source for Grafana.
Grafana (11.4.0) Serves as the resource monitoring dashboard.
Phison aiDAPTIVLink (NXUN202.00) Leverages middleware for model fine-tuning with full parameters.
Ollama (0.6.2) Acts as the inference server.
llama.cpp (full-cuda-b4897) Converts GGUF model file formats.
vsFTP (3.0.5) Provides model files for downloading.
Qdrant (1.12.4) Functions as the vector database.
Flowise (2.2.7-patch.1) Automates RAGOps functionality and workflows.
PostgreSQL (16.4) Serves as the relational database.
Unsloth (2025.3.18) Performs model fine-tuning with LoRA mode.

PreviousVersion History NextVersion 1.0