Supervised Fine-Tuning (SFT)

sft_train(
    model_name:str, 
    dataset_name:str=None, 
    hf_token:str='', 
    dataset_config_name:str=None, 
    data_from_hf:bool=True,
    do_split:bool=True, 
    split_ratio:float=0.2, 
    use_peft:bool=False, 
    lora_config:LoraConfig=None, 
    sft_config:SFTConfig=None, 
    data:dict={}, 
    wandb_config:wandbConfig=None, 
    use_ddp:bool=False, 
    use_zero:bool=True, sft_prompt_config:sftPromptConfig=None
)

model_name

string

required

The name of the model to be trained.

dataset_name

string

The name of the dataset to be used for training. Defaults to None.

hf_token

string

The Hugging Face token required for accessing private datasets or models. Defaults to an empty string.

dataset_config_name

string

The configuration name of the dataset, if applicable. Defaults to None.

data_from_hf

boolean

A flag to determine whether to load data from Hugging Face. Set to True by default.

do_split

boolean

A flag to determine whether to split the dataset into training and validation sets. Set to True by default.

split_ratio

float

The ratio of the dataset to be used for validation. Defaults to 0.2.

use_peft

boolean

A flag to enable Parameter-Efficient Fine-Tuning (PEFT). Defaults to False.

lora_config

LoraConfig

The configuration for LoRA (Low-Rank Adaptation) if use_peft is True. Defaults to None.

sft_config

SFTConfig

The configuration for supervised fine-tuning (SFT). Defaults to None.

data

dict

A dictionary containing the training data. Defaults to an empty dictionary.

wandb_config

wandbConfig

The configuration for Weights and Biases (WandB) logging. Defaults to None.

use_ddp

boolean

A flag to enable Distributed Data Parallel (DDP) training._ Defaults to False._

use_zero

boolean

A flag to enable ZeRO (Zero Redundancy Optimizer) for memory optimization. Defaults to True.

sft_prompt_config

sftPromptConfig

The configuration for the SFT prompts. Defaults to None.

Get Started

Training

Inference

Cloud

Supervised Fine-Tuning (SFT)