GPU Resource
Last updated
Last updated
Efficient GPU usage is critical for cost-effectiveness. GenAI Studio provides the following GPU resource management modes:
As-Is Mode
The system does not impose any restrictions on GPU resources. Under this mode, GPU resources are usually allocated on a first-come, first-served basis.
Training Dedicated Mode
The system restricts GPU resources to be used only for model training or related processes (such as validation, quantization, etc.). If the model training process is expected to take more than 12 hours, it is recommended to switch to this mode before starting the training to avoid the training process being forcibly interrupted before results are generated.
Time-Slot Mode
The system controls GPU resources based on the scheduled time slots, allowing them to be used for chat-related functions or model training-related functions. Please adjust the following settings according to the organization's needs to control which time slots GPU resources are used for chat-related functions. Time slots outside these settings are used for model training-related functions.
Note: When the time enters the chat-related function time slot, the system will forcibly interrupt any ongoing model training-related work.