GenAI Studio
  • Edge AI SDK/GenAI Studio
  • Getting Started
    • About GenAI Studio
    • Quickstart
      • Prerequisite
      • Installation
      • Utilities
    • Feature Overview
      • Inference Chat
      • Fine-tuning
      • Model Management
      • Application
    • Version History
      • Version 1.1
      • Version 1.0
  • Inference
    • Chat Inference
    • AI Agents
  • Finetune
    • Text-to-Text
      • Overview
      • Full Parameter
      • LoRA
    • Text-to-Image (Coming Soon)
    • Dataset Management
    • Schedule
  • Model
    • Model Management
  • Validation
  • Convert
  • Administration
    • Resource Monitoring
  • System Configuration
    • AI Providers
      • LLM Setup
      • Embedder Setup
      • Vector DB
      • Transcription Setup
    • System Administration
      • Users
      • Workspace Chats
      • Invites
      • GPU Resource
      • Register an App
    • Appearance Customization
    • Tools
      • Embedded Chat Widgets
      • Event Logs
      • Security & Access
  • Application
    • Text to Image
    • Background Removal
    • OCR
  • FAQ
    • Technical
Powered by GitBook
On this page
  • ✨ What's New?
  • 🔧 Revamped Management Interface & Optimized Architecture
  • 🔄 RAGOps Auto-Sync
  • 📊 Real-time System Monitoring
  • 🚀 Model Conversion & Inference Runtime
  • 🎯 Enhanced Fine-Tuning with LORA
  • ⚙️ Upgrades & Maintenance
  • Third-Party Updates
  1. Getting Started
  2. Version History

Version 1.1

PreviousVersion HistoryNextVersion 1.0

Last updated 17 days ago

We are excited to announce the release of GenAI Studio v1.1.0, featuring significant enhancements to boost your GenAI development and deployment experience.

✨ What's New?

🔧 Revamped Management Interface & Optimized Architecture

  • Entirely refactored GenAI Studio management interface

  • Enhanced architecture to better support Inference Apps such as Flux.1 Schnell (Text to Image) and ScrapeGraphAI

  • Easier management and streamlined deployment of diverse GenAI applications

🔄 RAGOps Auto-Sync

  • Automatic document synchronization from designated folders directly to your vector databases

  • Significantly improved RAG (Retrieval-Augmented Generation) workflow efficiency

📊 Real-time System Monitoring

  • Integrated Grafana and Prometheus for real-time system performance tracking

  • Proactively detect and address potential issues before they escalate

🚀 Model Conversion & Inference Runtime

  • Model conversion functionality has been added, along with the availability of the EdgeAI SDK for inference-side downloads and deployment. This makes it easier to deploy models to edge devices.

🎯 Enhanced Fine-Tuning with LORA

  • Integrated LORA (Low-Rank Adaptation) fine-tuning support via Unsloth

⚙️ Upgrades & Maintenance

  • AnythingLLM updated to v1.7.5, providing the latest features and improved security.

  • Phison Firmware upgraded to NXUN202.00, enhancing hardware performance and stability.

Third-Party Updates

GenAI Studio utilizes the following components:

  • node-exporter (1.8.2) Exposes desired host metrics to Prometheus.

  • dcgm-exporter (4.0.0-4.0.1-ubuntu22.04) Exposes host GPU metrics to Prometheus.

  • Prometheus (3.1.0) Collects desired metrics as a data source for Grafana.

  • Grafana (11.4.0) Serves as the resource monitoring dashboard.

  • Phison aiDAPTIVLink (NXUN202.00) Leverages middleware for model fine-tuning with full parameters.

  • Ollama (0.6.2) Acts as the inference server.

  • llama.cpp (full-cuda-b4897) Converts GGUF model file formats.

  • vsFTP (3.0.5) Provides model files for downloading.

  • Qdrant (1.12.4) Functions as the vector database.

  • Flowise (2.2.7-patch.1) Automates RAGOps functionality and workflows.

  • PostgreSQL (16.4) Serves as the relational database.

  • Unsloth (2025.3.18) Performs model fine-tuning with LoRA mode.

Compatible with inference models like for precise, customized adjustments

DeepSeek