Components
Finetune
lazyllm.components.finetune.AlpacaloraFinetune
Bases: LazyLLMFinetuneBase
This class is a subclass of LazyLLMFinetuneBase, based on the LoRA fine-tuning capabilities provided by the alpaca-lora project, used for LoRA fine-tuning of large language models.
Parameters:
-
base_model(str) –Path to the base model for fine-tuning.
-
target_path(str) –Path to save LoRA weights of the fine-tuned model.
-
merge_path(Optional[str], default:None) –Path to save merged LoRA weights, default
None. If not provided, "lazyllm_lora" and "lazyllm_merge" directories are created undertarget_path. -
model_name(Optional[str], default:'LLM') –Model name used as log prefix, default "LLM".
-
cp_files(Optional[str], default:'tokeniz*') –Configuration files copied from base model path to
merge_path, defaulttokeniz*. -
launcher(launcher, default:remote(ngpus=1)) –Launcher for fine-tuning, default
launchers.remote(ngpus=1). -
kw(dict, default:{}) –Keyword arguments to update default training parameters:
Other Parameters:
-
data_path(Optional[str]) –Path to dataset, default
None. -
batch_size(Optional[int]) –Batch size, default 64.
-
micro_batch_size(Optional[int]) –Micro-batch size, default 4.
-
num_epochs(Optional[int]) –Number of training epochs, default 2.
-
learning_rate(Optional[float]) –Learning rate, default 5.e-4.
-
cutoff_len(Optional[int]) –Cutoff length, default 1030.
-
filter_nums(Optional[int]) –Number of filters, default 1024.
-
val_set_size(Optional[int]) –Validation set size, default 200.
-
lora_r(Optional[int]) –LoRA rank, default 8.
-
lora_alpha(Optional[int]) –LoRA fusion factor, default 32.
-
lora_dropout(Optional[float]) –LoRA dropout rate, default 0.05.
-
lora_target_modules(Optional[str]) –LoRA target modules, default
[wo,wqkv]. -
modules_to_save(Optional[str]) –Modules for full fine-tuning, default
[tok_embeddings,output]. -
deepspeed(Optional[str]) –Path to DeepSpeed config, default uses repository pre-made ds.json.
-
prompt_template_name(Optional[str]) –Name of prompt template, default "alpaca".
-
train_on_inputs(Optional[bool]) –Whether to train on inputs, default
True. -
show_prompt(Optional[bool]) –Whether to show the prompt, default
False. -
nccl_port(Optional[int]) –NCCL port, default random between 19000-20500.
Examples:
>>> from lazyllm import finetune
>>> trainer = finetune.alpacalora('path/to/base/model', 'path/to/target')
Source code in lazyllm/components/finetune/alpacalora.py
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 | |
cmd(trainset, valset=None)
Generate shell command sequence for Alpaca-LoRA fine-tuning and model merging.
Parameters:
-
trainset(str) –Training dataset path, supports both relative path (to configured data_path) and absolute path
-
valset(str, default:None) –Validation dataset path, will auto-split from trainset if not specified
Returns:
- str or list: Returns a single command string when no merging needed, otherwise returns a list containing: [fine-tune command, merge command, file copy command]
Examples:
>>> from lazyllm import finetune
>>> trainer = finetune.alpacalora('path/to/base/model', 'path/to/target')
>>> cmd = trainer.cmd("my_dataset.json")
Source code in lazyllm/components/finetune/alpacalora.py
lazyllm.components.finetune.CollieFinetune
Bases: LazyLLMFinetuneBase
This class is a subclass of LazyLLMFinetuneBase, based on the LoRA fine-tuning capabilities provided by the Collie framework, used for LoRA fine-tuning of large language models.
Parameters:
-
base_model(str) –Path to the base model for fine-tuning.
-
target_path(str) –Path to save LoRA weights of the fine-tuned model.
-
merge_path(Optional[str], default:None) –Path to save merged LoRA weights, default
None. If not provided, "lazyllm_lora" and "lazyllm_merge" directories are created undertarget_path. -
model_name(Optional[str], default:'LLM') –Model name used as log prefix, default "LLM".
-
cp_files(Optional[str], default:'tokeniz*') –Configuration files copied from base model path to
merge_path, default "tokeniz*". -
launcher(launcher, default:remote(ngpus=1)) –Launcher for fine-tuning, default
launchers.remote(ngpus=1). -
kw(dict, default:{}) –Keyword arguments to update default training parameters:
Other Parameters:
-
data_path(Optional[str]) –Path to dataset, default
None. -
batch_size(Optional[int]) –Batch size, default 64.
-
micro_batch_size(Optional[int]) –Micro-batch size, default 4.
-
num_epochs(Optional[int]) –Number of training epochs, default 3.
-
learning_rate(Optional[float]) –Learning rate, default 5.e-4.
-
dp_size(Optional[int]) –Data parallelism parameter, default 8.
-
pp_size(Optional[int]) –Pipeline parallelism parameter, default 1.
-
tp_size(Optional[int]) –Tensor parallelism parameter, default 1.
-
lora_r(Optional[int]) –LoRA rank, default 8.
-
lora_alpha(Optional[int]) –LoRA fusion factor, default 16.
-
lora_dropout(Optional[float]) –LoRA dropout rate, default 0.05.
-
lora_target_modules(Optional[str]) –LoRA target modules, default
[wo,wqkv]. -
modules_to_save(Optional[str]) –Modules for full fine-tuning, default
[tok_embeddings,output]. -
prompt_template_name(Optional[str]) –Name of prompt template, default "alpaca".
Examples:
>>> from lazyllm import finetune
>>> trainer = finetune.collie('path/to/base/model', 'path/to/target')
Source code in lazyllm/components/finetune/collie.py
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 | |
lazyllm.components.finetune.LlamafactoryFinetune
Bases: LazyLLMFinetuneBase
This class is a subclass of LazyLLMFinetuneBase, based on the training capabilities provided by the LLaMA-Factory framework, used for training large language models(or visual language models).
Parameters:
-
base_model–Path to the base model used for training. Supports local paths; if the path does not exist, it will attempt to locate it from the configured model directory.
-
target_path–Target directory to save model weights after training is completed.
-
merge_path(str, default:None) –Path to save the model after merging LoRA weights. Defaults to None. If not specified, two directories will be automatically created under
target_path: - "lazyllm_lora" (for storing LoRA fine-tuned weights) - "lazyllm_merge" (for storing the merged model weights) -
config_path(str, default:None) –Path to the YAML file containing training configuration. Defaults to None. If not specified, the default config file
llama_factory/sft.yamlwill be used. This file can override default training parameters. -
export_config_path(str, default:None) –Path to the YAML file for LoRA weight export/merging configuration. Defaults to None. If not specified, the default config file
llama_factory/lora_export.yamlwill be used. -
lora_r(int, default:None) –Rank of the LoRA adaptation. If provided, overrides the
lora_rankvalue in the configuration. -
modules_to_save(str, default:None) –List of additional module names to be saved. Should be provided as a string in Python list format, e.g., "[module1, module2]".
-
lora_target_modules(str, default:None) –List of module names to apply LoRA fine-tuning to. Format is the same as above.
-
launcher(launcher, default:remote(ngpus=1, sync=True)) –Launcher for the fine-tuning task. Defaults to a single-GPU, synchronous remote launcher:
launchers.remote(ngpus=1, sync=True). -
**kw–Additional keyword arguments used to dynamically override default parameters in the training configuration.
Other Parameters:
-
stage(Literal['pt', 'sft', 'rm', 'ppo', 'dpo', 'kto']) –Default is:
sft. Which stage will be performed in training. -
do_train(bool) –Default is:
True. Whether to run training. -
finetuning_type(Literal['lora', 'freeze', 'full']) –Default is:
lora. Which fine-tuning method to use. -
lora_target(str) –Default is:
all. Name(s) of target modules to apply LoRA. Use commas to separate multiple modules. Useallto specify all the linear modules. -
template(Optional[str]) –Default is:
None. Which template to use for constructing prompts in training and inference. -
cutoff_len(int) –Default is:
1024. The cutoff length of the tokenized inputs in the dataset. -
max_samples(Optional[int]) –Default is:
1000. For debugging purposes, truncate the number of examples for each dataset. -
overwrite_cache(bool) –Default is:
True. Overwrite the cached training and evaluation sets. -
preprocessing_num_workers(Optional[int]) –Default is:
16. The number of processes to use for the pre-processing. -
dataset_dir(str) –Default is:
lazyllm_temp_dir. Path to the folder containing the datasets. If not explicitly specified, LazyLLM will generate adataset_info.jsonfile in the.tempfolder in the current working directory for use by LLaMA-Factory. -
logging_steps(float) –Default is:
10. Log every X updates steps. Should be an integer or a float in range[0,1). If smaller than 1, will be interpreted as ratio of total training steps. -
save_steps(float) –Default is:
500. Save checkpoint every X updates steps. Should be an integer or a float in range[0,1). If smaller than 1, will be interpreted as ratio of total training steps. -
plot_loss(bool) –Default is:
True. Whether or not to save the training loss curves. -
overwrite_output_dir(bool) –Default is:
True. Overwrite the content of the output directory. -
per_device_train_batch_size(int) –Default is:
1. Batch size per GPU/TPU/MPS/NPU core/CPU for training. -
gradient_accumulation_steps(int) –Default is:
8. Number of updates steps to accumulate before performing a backward/update pass. -
learning_rate(float) –Default is:
1e-04. The initial learning rate for AdamW. -
num_train_epochs(float) –Default is:
3.0. Total number of training epochs to perform. -
lr_scheduler_type(Union[SchedulerType, str]) –Default is:
cosine. The scheduler type to use. -
warmup_ratio(float) –Default is:
0.1. Linear warmup over warmup_ratio fraction of total steps. -
fp16(bool) –Default is:
True. Whether to use fp16 (mixed) precision instead of 32-bit. -
ddp_timeout(Optional[int]) –Default is:
180000000. Overrides the default timeout for distributed training (value should be given in seconds). -
report_to(Union[NoneType, str, List[str]]) –Default is:
tensorboard. The list of integrations to report the results and logs to. -
val_size(float) –Default is:
0.1. Size of the development set, should be an integer or a float in range[0,1). -
per_device_eval_batch_size(int) –Default is:
1. Batch size per GPU/TPU/MPS/NPU core/CPU for evaluation. -
eval_strategy(Union[IntervalStrategy, str]) –Default is:
steps. The evaluation strategy to use. -
eval_steps(Optional[float]) –Default is:
500. Run an evaluation every X steps. Should be an integer or a float in range[0,1). If smaller than 1, will be interpreted as ratio of total training steps.
Examples:
>>> from lazyllm import finetune
>>> trainer = finetune.llamafactory('internlm2-chat-7b', 'path/to/target')
<lazyllm.llm.finetune type=LlamafactoryFinetune>
Source code in lazyllm/components/finetune/llamafactory.py
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 | |
cmd(trainset, valset=None)
Generate LLaMA-Factory fine-tuning command sequence, including training and model merge commands.
Parameters:
-
trainset(str) –Training dataset path (supports relative path to lazyllm.config['data_path'])
-
valset(str, default:None) –Validation dataset path (not directly used in current implementation)
Returns:
- str: Complete shell command string containing:
- Training command (with auto-configured parameters)
- Log redirection (saved to target path)
- Optional model merge command (when LoRA is configured)
Notes
- Automatically generates timestamped training log files
- Temporary files are automatically cleaned up after use
- Supports multiple data formats (alpaca/sharegpt etc.)
- Multimodal data (images/videos/audios) is automatically detected and handled
Source code in lazyllm/components/finetune/llamafactory.py
lazyllm.components.deploy.LazyLLMDeployBase
Bases: ComponentBase
This class is a subclass of ComponentBase that provides basic functionality for LazyLLM deployment. It supports encoding conversion for various media types and provides configuration options for result extraction and streaming processing.
Parameters:
-
launcher(LauncherBase, default:remote()) –Launcher instance for deployment, defaults to remote launcher (
launchers.remote()).
Notes
- Need to implement specific deployment logic when inheriting this class
- Can customize result extraction logic by overriding the extract_result method
Examples:
>>> import lazyllm
>>> from lazyllm.components.deploy.base import LazyLLMDeployBase
>>> class MyDeployer(LazyLLMDeployBase):
... def __call__(self, inputs):
... return processed_result
def extract_result(output, inputs):
... return output.json()['result']
>>> deployer = MyDeployer()
>>> result = deployer.extract_result(raw_output, input_data)
Source code in lazyllm/components/deploy/base.py
extract_result(output, inputs)
staticmethod
Extract final result from model output. The default implementation returns raw output directly, subclasses can override this method to implement custom result extraction logic.
Parameters:
-
output–Raw model output
-
inputs–Original input data, can be used for post-processing
Returns:
- Processed final result
Source code in lazyllm/components/deploy/base.py
lazyllm.components.deploy.LazyLLMDeployBase.extract_result(output, inputs)
staticmethod
Extract final result from model output. The default implementation returns raw output directly, subclasses can override this method to implement custom result extraction logic.
Parameters:
-
output–Raw model output
-
inputs–Original input data, can be used for post-processing
Returns:
- Processed final result
Source code in lazyllm/components/deploy/base.py
lazyllm.components.finetune.FlagembeddingFinetune
Bases: LazyLLMFinetuneBase
This class is a subclass of LazyLLMFinetuneBase, based on the training capabilities provided by the FlagEmbedding framework, used for training embedding and reranker models.
Parameters:
-
base_model(str) –The base model used for training. It is required to be the path of the base model.
-
target_path(str) –The path where the trained model weights are saved.
-
launcher(launcher, default:remote(ngpus=1, sync=True)) –The launcher for fine-tuning, default is
launchers.remote(ngpus=1, sync=True). -
kw–Keyword arguments used to update the default training parameters.
The keyword arguments and their default values for this class of embedding model are as follows:
Other Parameters:
-
train_group_size(int) –Default is:
8. The size of train group. It is used to control the number of negative samples in each training set. -
query_max_len(int) –Default is:
512. The maximum total input sequence length after tokenization for passage. Sequences longer than this will be truncated, sequences shorter will be padded. -
passage_max_len(int) –Default is:
512. The maximum total input sequence length after tokenization for passage. Sequences longer than this will be truncated, sequences shorter will be padded. -
pad_to_multiple_of(int) –Default is:
8. If set will pad the sequence to be a multiple of the provided value. -
query_instruction_for_retrieval(str) –Default is:
Represent this sentence for searching relevant passages:. Instruction for query. -
query_instruction_format(str) –Default is:
{}{}. Format for query instruction. -
learning_rate(float) –Default is:
1e-5. Learning rate. -
num_train_epochs(int) –Default is:
1. Total number of training epochs to perform. -
per_device_train_batch_size(int) –Default is:
2. Train batch size -
gradient_accumulation_steps(int) –Default is:
1. Number of updates steps to accumulate before performing a backward/update pass. -
dataloader_drop_last(bool) –Default is:
True. When it='True', the last incomplete batch is dropped if the dataset size is not divisible by the batch size, meaning DataLoader only returns complete batches. -
warmup_ratio(float) –Default is:
0.1. Warmup ratio for linear scheduler. -
weight_decay(float) –Default is:
0.01. Weight decay in AdamW. -
deepspeed(str) –Default is:
``. The path of the DeepSpeed configuration file, default to use the pre-made configuration file in the LazyLLM code repository:ds_stage0.json``. -
logging_steps(int) –Default is:
1. Logging frequency according to logging strategy. -
save_steps(int) –Default is:
1000. Saving frequency. -
temperature(float) –Default is:
0.02. Temperature used for similarity score -
sentence_pooling_method(str) –Default is:
cls. The pooling method. Available options: 'cls', 'mean', 'last_token'. -
normalize_embeddings(bool) –Default is:
True. Whether to normalize the embeddings. -
kd_loss_type(str) –Default is:
kl_div. The loss type for knowledge distillation. Available options:'kl_div', 'm3_kd_loss'. -
overwrite_output_dir(bool) –Default is:
True. It is used to allow the program to overwrite an existing output directory. -
fp16(bool) –Default is:
True. Whether to use fp16 (mixed) precision instead of 32-bit. -
gradient_checkpointing(bool) –Default is:
True. Whether enable gradient checkpointing. -
negatives_cross_device(bool) –Default is:
True. Whether share negatives across devices.
The keyword arguments and their default values for this class of reranker model are as follows:
Other Parameters:
-
train_group_size(int) –Default is:
8. The size of train group. It is used to control the number of negative samples in each training set. -
query_max_len(int) –Default is:
256. The maximum total input sequence length after tokenization for passage. Sequences longer than this will be truncated, sequences shorter will be padded. -
passage_max_len(int) –Default is:
256. The maximum total input sequence length after tokenization for passage. Sequences longer than this will be truncated, sequences shorter will be padded. -
pad_to_multiple_of(int) –Default is:
8. If set will pad the sequence to be a multiple of the provided value. -
learning_rate(float) –Default is:
6e-5. Learning rate. -
num_train_epochs(int) –Default is:
1. Total number of training epochs to perform. -
per_device_train_batch_size(int) –Default is:
2. Train batch size -
gradient_accumulation_steps(int) –Default is:
1. Number of updates steps to accumulate before performing a backward/update pass. -
dataloader_drop_last(bool) –Default is:
True. When it='True', the last incomplete batch is dropped if the dataset size is not divisible by the batch size, meaning DataLoader only returns complete batches. -
warmup_ratio(float) –Default is:
0.1. Warmup ratio for linear scheduler. -
weight_decay(float) –Default is:
0.01. Weight decay in AdamW. -
deepspeed(str) –Default is:
``. The path of the DeepSpeed configuration file, default to use the pre-made configuration file in the LazyLLM code repository:ds_stage0.json``. -
logging_steps(int) –Default is:
1. Logging frequency according to logging strategy. -
save_steps(int) –Default is:
1000. Saving frequency. -
overwrite_output_dir(bool) –Default is:
True. It is used to allow the program to overwrite an existing output directory. -
fp16(bool) –Default is:
True. Whether to use fp16 (mixed) precision instead of 32-bit. -
gradient_checkpointing(bool) –Default is:
True. Whether enable gradient checkpointing.
Examples:
>>> from lazyllm import finetune
>>> finetune.FlagembeddingFinetune('bge-m3', 'path/to/target')
<lazyllm.llm.finetune type=FlagembeddingFinetune>
Source code in lazyllm/components/finetune/flagembedding.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 | |
lazyllm.components.auto.AutoFinetune
Bases: LazyLLMFinetuneBase
This class is a subclass of LazyLLMFinetuneBase and can automatically select the appropriate fine-tuning framework and parameters based on the input arguments to fine-tune large language models.
Specifically, based on the input model parameters of base_model, ctx_len, batch_size, lora_r, the type and number of GPUs in launcher, this class can automatically select the appropriate fine-tuning framework (such as: AlpacaloraFinetune or CollieFinetune) and the required parameters.
Parameters:
-
base_model(str) –The base model used for fine-tuning. It is required to be the path of the base model.
-
source(config[model_source]) –Specifies the model download source. This can be configured by setting the environment variable
LAZYLLM_MODEL_SOURCE. -
target_path(str) –The path where the LoRA weights of the fine-tuned model are saved.
-
merge_path(str) –The path where the model merges the LoRA weights, default to
None. If not specified, "lazyllm_lora" and "lazyllm_merge" directories will be created undertarget_pathastarget_pathandmerge_pathrespectively. -
ctx_len(int) –The maximum token length for input to the fine-tuned model, default to
1024. -
batch_size(int) –Batch size, default to
32. -
lora_r(int) –LoRA rank, default to
8; this value determines the amount of parameters added, the smaller the value, the fewer the parameters. -
launcher(launcher, default:remote()) –The launcher for fine-tuning, default to
launchers.remote(ngpus=1). -
kw–Keyword arguments, used to update the default training parameters. Note that additional keyword arguments cannot be arbitrarily specified, as they depend on the framework inferred by LazyLLM, so it is recommended to set them with caution.
Examples:
>>> from lazyllm import finetune
>>> finetune.auto("internlm2-chat-7b", 'path/to/target')
<lazyllm.llm.finetune type=AlpacaloraFinetune>
Source code in lazyllm/components/auto/autofinetune.py
lazyllm.components.finetune.base.DummyFinetune
Bases: LazyLLMFinetuneBase
DummyFinetune is a subclass of LazyLLMFinetuneBase that serves as a placeholder implementation for fine-tuning. The class is primarily used for demonstration or testing purposes, as it does not perform any actual fine-tuning logic.
Parameters:
-
base_model–A string specifying the base model name. Defaults to 'base'.
-
target_path–A string specifying the target path for fine-tuning outputs. Defaults to 'target'.
-
launcher–A launcher instance for executing commands. Defaults to [launchers.remote()][lazyllm.launchers.remote].
-
**kw–Additional keyword arguments that are stored for later use.
Returns:
-
–
A string representing a dummy command. The string includes the initial arguments passed during initialization.
Examples:
>>> from lazyllm.components import DummyFinetune
>>> from lazyllm import launchers
>>> # 创建一个 DummyFinetune 实例
>>> finetuner = DummyFinetune(base_model='example-base', target_path='example-target', launcher=launchers.local(), custom_arg='custom_value')
>>> # 调用 cmd 方法生成占位命令
>>> command = finetuner.cmd('--example-arg', key='value')
>>> print(command)
... echo 'dummy finetune!, and init-args is {'custom_arg': 'custom_value'}'
Source code in lazyllm/components/finetune/base.py
cmd(*args, **kw)
The cmd method generates a dummy command string for fine-tuning. This method is primarily for testing or demonstration purposes.
Parameters:
-
*args–Positional arguments to be included in the command (not used in this implementation).
-
**kw–Keyword arguments to be included in the command (not used in this implementation).
Returns:
-
str–A string representing a dummy command. The string includes the initial arguments (
**kw) passed during the instance initialization, which are stored inself.kw.
Example
If the class is initialized with custom_arg='value', calling the cmd method will return:
"echo 'dummy finetune!, and init-args is {'custom_arg': 'value'}'"
Examples:
>>> from lazyllm.components import DummyFinetune
>>> from lazyllm import launchers
>>> # 创建一个 DummyFinetune 实例,并传递初始化参数
>>> finetuner = DummyFinetune(base_model='example-base', target_path='example-target', launcher=launchers.local(), custom_arg='value')
>>> # 调用 cmd 方法生成占位命令
>>> command = finetuner.cmd()
>>> # 打印生成的占位命令
>>> print(command)
... echo 'dummy finetune!, and init-args is {'custom_arg': 'value'}'
Source code in lazyllm/components/finetune/base.py
lazyllm.components.finetune.LazyLLMFinetuneBase
Bases: ComponentBase
LazyLLM fine-tuning base component class, inherits from ComponentBase.
Provides base functionality for large language model fine-tuning, supports remote launcher configuration and model path management.
Parameters:
-
base_model(str) –Base model path or identifier
-
target_path(str) –Fine-tuned model output path
-
launcher(Launcher, default:remote()) –Task launcher, defaults to remote launcher
Source code in lazyllm/components/finetune/base.py
Deploy
lazyllm.components.deploy.Lightllm
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase, based on the inference capabilities provided by the LightLLM framework, used for inference with large language models.
Parameters:
-
trust_remote_code(bool, default:True) –Whether to trust remote code, defaults to True
-
launcher(Launcher, default:remote(ngpus=1)) –Task launcher, defaults to single GPU remote launcher
-
log_path(str, default:None) –Log file path, defaults to None
-
**kw–Other LightLLM server configuration parameters
The keyword arguments and their default values for this class are as follows:
Other Parameters:
-
tp(int) –Tensor parallelism parameter, default is
1. -
max_total_token_num(int) –Maximum total token number, default is
64000. -
eos_id(int) –End-of-sentence ID, default is
2. -
port(int) –Service port number, default is
None, in which case LazyLLM will automatically generate a random port number. -
host(str) –Service IP address, default is
0.0.0.0. -
nccl_port(int) –NCCL port, default is
None, in which case LazyLLM will automatically generate a random port number. -
tokenizer_mode(str) –Tokenizer loading mode, default is
auto. -
running_max_req_size(int) –Maximum number of parallel requests for the inference engine, default is
256. -
data_type(str) –Data type for model weights, default is
float16. -
max_req_total_len(int) –Maximum total length for requests, default is
64000. -
max_req_input_len(int) –Maximum input length, default is
4096. -
long_truncation_mode(str) –Truncation mode for long texts, default is
head.
Examples:
Source code in lazyllm/components/deploy/lightllm.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 | |
cmd(finetuned_model=None, base_model=None)
This method generates the command to start the LightLLM service.
Parameters:
-
finetuned_model(str, default:None) –Path to the fine-tuned model.
-
base_model(str, default:None) –Path to the base model, used when finetuned_model is invalid.
Returns:
- LazyLLMCMD: A LazyLLMCMD object containing the startup command.
Source code in lazyllm/components/deploy/lightllm.py
geturl(job=None)
Get the URL address of the LightLLM service.
Parameters:
-
job(optional, default:None) –Job object, defaults to None, in which case self.job is used.
Returns:
- str: The service URL address in the format "http://{ip}:{port}/generate".
Source code in lazyllm/components/deploy/lightllm.py
extract_result(x, inputs)
staticmethod
Extract generated text from the service response.
Parameters:
-
x(str) –Response text from the service.
-
inputs(str) –Input text.
Returns:
- str: The extracted generated text.
Raises:
-
Exception–When JSON response parsing fails.
Source code in lazyllm/components/deploy/lightllm.py
lazyllm.components.deploy.Vllm
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase, leveraging the VLLM framework to deploy and run inference on large language models.
Parameters:
-
trust_remote_code(bool, default:True) –Whether to allow loading of model code from remote sources. Default is
True. -
launcher(launcher, default:remote(ngpus=1)) –The launcher used to start the model. Default is
launchers.remote(ngpus=1). -
log_path(str, default:None) –Path to store logs. If
None, logs will not be saved. -
openai_api(bool, default:None) –Whether to start VLLM with OpenAI-compatible API. Default is
False. -
kw–Keyword arguments used to override default deployment parameters. No extra arguments beyond the supported ones are allowed.
The supported keyword arguments and their default values are as follows:
Other Parameters:
-
tensor-parallel-size(int) –Tensor parallelism size. Default is
1. -
dtype(str) –Data type for model weights and activations. Default is
auto. Options include:half,float16,bfloat16,float,float32. -
kv-cache-dtype(str) –Data type for KV cache. Default is
auto. Options include:fp8,fp8_e5m2,fp8_e4m3. -
device(str) –Backend device type supported by VLLM. Default is
auto. Options include:cuda,neuron,cpu. -
block-size(int) –Token block size. Default is
16. -
port(int | str) –Service port number. Default is
auto(random assignment). -
host(str) –Service binding IP address. Default is
0.0.0.0. -
seed(int) –Random seed. Default is
0. -
tokenizer_mode(str) –Tokenizer loading mode. Default is
auto. -
max-num-seqs(int) –Maximum number of concurrent requests supported by the inference engine. Default is
256. -
pipeline-parallel-size(int) –Pipeline parallelism size. Default is
1. -
max-num-batched-tokens(int) –Maximum number of batched tokens. Default is
64000.
Examples:
Source code in lazyllm/components/deploy/vllm.py
33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 | |
cmd(finetuned_model=None, base_model=None, master_ip=None)
Build the command to launch the vLLM inference service.
This method validates the model path and constructs an executable command string based on current configuration. In distributed mode, it will also prepend the ray cluster start command.
Parameters:
-
finetuned_model(str, default:None) –Path to the fine-tuned model.
-
base_model(str, default:None) –Fallback base model path if finetuned_model is invalid.
-
master_ip(str, default:None) –IP address of the master node in a distributed setup.
Returns:
- LazyLLMCMD: The command object with shell instruction, return value handler, and health checker.
Source code in lazyllm/components/deploy/vllm.py
geturl(job=None)
Get the inference service URL for the vLLM deployment.
Depending on the execution mode (Display or actual deployment), this method returns the appropriate URL for accessing the model's generate endpoint.
Parameters:
-
job(Job, default:None) –Deployment job object. Defaults to the module's associated job.
Returns:
- str: The HTTP URL for inference service.
Source code in lazyllm/components/deploy/vllm.py
extract_result(x, inputs)
staticmethod
Extracts the inference result from the JSON string returned by the VLLM service.
Parameters:
-
x(str) –Raw JSON string returned from the VLLM service.
-
inputs(Any) –Input arguments (not used here, kept for interface consistency).
Returns:
- str: The generated text result from the model.
Source code in lazyllm/components/deploy/vllm.py
lazyllm.components.deploy.LMDeploy
Bases: LazyLLMDeployBase
The LMDeploy class, a subclass of LazyLLMDeployBase,
leverages LMDeploy to launch and manage large language model inference services.
Parameters:
-
launcher(Optional[launcher], default:remote(ngpus=1)) –The service launcher, defaults to
launchers.remote(ngpus=1). -
trust_remote_code(bool, default:True) –Whether to trust remote code, defaults to
True. -
log_path(Optional[str], default:None) –Path to store logs, defaults to
None. -
**kw–Keyword arguments used to update the default deployment configuration. No extra arguments beyond those listed below are allowed.
Other Parameters:
-
tp(int) –Tensor parallelism factor, defaults to
1. -
server-name(str) –The IP address on which the service listens, defaults to
0.0.0.0. -
server-port(Optional[int]) –Port number for the service. Defaults to
None; in this case, a random port between 30000–40000 will be assigned. -
max-batch-size(int) –Maximum batch size, defaults to
128. -
chat-template(Optional[str]) –Path to the chat template file. If the model is not a vision-language model and no template is specified, a default template will be used.
-
eager-mode(bool) –Whether to enable eager mode, controlled by the environment variable
LMDEPLOY_EAGER_MODE, defaults toFalse.
Examples:
>>> # Basic use:
>>> from lazyllm import deploy
>>> infer = deploy.LMDeploy()
>>>
>>> # MultiModal:
>>> import lazyllm
>>> from lazyllm import deploy, globals
>>> from lazyllm.components.formatter import encode_query_with_filepaths
>>> chat = lazyllm.TrainableModule('InternVL3_5-1B').deploy_method(deploy.LMDeploy)
>>> chat.update_server()
>>> inputs = encode_query_with_filepaths('What is it?', ['path/to/image'])
>>> res = chat(inputs)
Source code in lazyllm/components/deploy/lmdeploy.py
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 | |
cmd(finetuned_model=None, base_model=None)
This method generates the command to start the LMDeploy service.
Parameters:
-
finetuned_model(str, default:None) –Path to the fine-tuned model.
-
base_model(str, default:None) –Path to the base model, used when finetuned_model is invalid.
Returns:
- LazyLLMCMD: A LazyLLMCMD object containing the startup command.
Source code in lazyllm/components/deploy/lmdeploy.py
geturl(job=None)
Get the URL address of the LMDeploy service.
Parameters:
-
job(optional, default:None) –Job object, defaults to None, in which case self.job is used.
Returns:
- str: The service URL address in the format "http://{ip}:{port}/v1/chat/interactive".
Source code in lazyllm/components/deploy/lmdeploy.py
extract_result(x, inputs)
staticmethod
Parses the model inference result and extracts the text output from a JSON response string.
Parameters:
-
x(str) –JSON-formatted string returned by the model.
-
inputs(dict) –The original input data (not directly used, reserved for interface compatibility).
Returns:
- str: The text result extracted from the response.
Source code in lazyllm/components/deploy/lmdeploy.py
lazyllm.components.deploy.base.DummyDeploy
Bases: LazyLLMDeployBase, Pipeline
DummyDeploy(launcher=launchers.remote(sync=False), , stream=False, *kw)
A mock deployment class for testing purposes. It extends both LazyLLMDeployBase and flows.Pipeline,
simulating a simple pipeline-style deployable service with optional streaming support.
This class is primarily intended for internal testing and demonstration. It receives inputs in the format defined
by message_format, and returns a dummy response or a streaming response depending on the stream flag.
Parameters:
-
launcher–Deployment launcher instance, defaulting to
launchers.remote(sync=False). -
stream(bool, default:False) –Whether to simulate streaming output.
-
kw–Additional keyword arguments passed to the superclass.
Call Arguments
keys_name_handle (dict): Mapping of input keys for request formatting.
message_format (dict): Default request template including input and generation parameters.
Source code in lazyllm/components/deploy/base.py
lazyllm.components.auto.AutoDeploy
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase that automatically selects the appropriate inference framework and parameters based on the input arguments for inference with large language models.
Specifically, based on the input base_model parameters, max_token_num, the type and number of GPUs in launcher, this class can automatically select the appropriate inference framework (such as Lightllm or Vllm) and the required parameters.
Parameters:
-
base_model(str) –The base model for fine-tuning, which is required to be the name or the path to the base model. Used to provide base model information.
-
source(config[model_source]) –Specifies the model download source. This can be configured by setting the environment variable
LAZYLLM_MODEL_SOURCE. -
trust_remote_code(bool) –Whether to allow loading of model code from remote servers, default is
True. -
launcher(launcher, default:remote()) –The launcher for fine-tuning, default is
launchers.remote(ngpus=1). -
stream(bool) –Whether the response is streaming, default is
False. -
type(str) –Type parameter, default is
None, which corresponds to thellmtype. Additionally, theembedtype is also supported. -
max_token_num(int) –The maximum token length for the input fine-tuning model, default is
1024. -
launcher(launcher, default:remote()) –The launcher for fine-tuning, default is
launchers.remote(ngpus=1). -
kw–Keyword arguments used to update default training parameters. Note that whether additional keyword arguments can be specified depends on the framework inferred by LazyLLM, so it is recommended to set them carefully.
Examples:
>>> from lazyllm import deploy
>>> deploy.auto('internlm2-chat-7b')
<lazyllm.llm.deploy type=Lightllm>
Source code in lazyllm/components/auto/autodeploy.py
get_deployer(base_model, source=None, trust_remote_code=True, launcher=None, type=None, log_path=None, **kw)
classmethod
Get corresponding deployer class based on model type.
Automatically detects model type and returns the most suitable deployer class, launcher and configuration parameters.
Parameters:
-
base_model(str) –Base model name or path.
-
source(Optional[str], default:None) –Model source.
-
trust_remote_code(bool, default:True) –Whether to trust remote code.
-
launcher(Optional[LazyLLMLaunchersBase], default:None) –Launcher instance.
-
type(Optional[str], default:None) –Model type.
-
log_path(Optional[str], default:None) –Log file path.
-
**kw–Other configuration parameters.
Returns:
- Tuple: Returns (deployer class, launcher, configuration parameters dict) triple.
Source code in lazyllm/components/auto/autodeploy.py
lazyllm.components.deploy.embed.AbstractEmbedding
Bases: ABC
Abstract embedding base class that provides unified interface and basic functionality for all embedding models. This class defines the standard interface for embedding models, including model loading, calling, and serialization capabilities.
Parameters:
-
base_embed(str) –The base path or identifier of the embedding model, used to specify which embedding model to load.
-
source(str, default:None) –Model source, default to
None. If not specified, will use the default model source from LazyLLM configuration. -
init(bool, default:False) –Whether to load the model immediately during initialization, default to
False. IfTrue, will call theload_embed()method immediately when the object is created.
Source code in lazyllm/components/deploy/embed.py
load_embed()
abstractmethod
Abstract method for loading embedding models. This method is implemented by subclasses to perform specific model loading logic.
Note: This method is currently under development.
Source code in lazyllm/components/deploy/embed.py
lazyllm.components.deploy.EmbeddingDeploy
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase, designed for deploying text embedding services. It supports both dense and sparse embedding methods, compatible with HuggingFace models and FlagEmbedding models.
Parameters:
-
launcher(Optional[launcher], default:None) –The launcher instance, defaults to
None. -
model_type(Optional[str], default:'embed') –Model type, defaults to
'embed'. -
log_path(Optional[str], default:None) –Path for log file, defaults to
None. -
embed_type(Optional[str], default:'dense') –Embedding type, either
'dense'or'sparse', defaults to'dense'. -
trust_remote_code(bool, default:True) –Whether to trust remote code, defaults to
True. -
port(Optional[int], default:None) –Service port number, defaults to
None, in which case LazyLLM will generate a random port.
Call Arguments
finetuned_model (Optional[str]): Path or name of the fine-tuned model.
base_model (Optional[str]): Path or name of the base model, used when finetuned_model is invalid.
Examples:
>>> from lazyllm import deploy
>>> embed_service = deploy.EmbeddingDeploy(embed_type='dense')
>>> embed_service('path/to/model')
Source code in lazyllm/components/deploy/embed.py
lazyllm.components.deploy.embed.RerankDeploy
Bases: EmbeddingDeploy
This class is a subclass of EmbeddingDeploy, designed for deploying reranking services. It supports text reranking using HuggingFace models.
Parameters:
-
launcher(launcher, default:None) –The launcher instance, defaults to
None. -
model_type(str, default:'embed') –Model type, defaults to
'embed'. -
log_path(str, default:None) –Path for log file, defaults to
None. -
trust_remote_code(bool, default:True) –Whether to trust remote code, defaults to
True. -
port(int, default:None) –Service port number, defaults to
None, in which case LazyLLM will generate a random port.
Call Arguments
finetuned_model: Path or name of the fine-tuned model.
base_model: Path or name of the base model, used when finetuned_model is invalid.
Examples:
>>> from lazyllm import deploy
>>> rerank_service = deploy.embed.RerankDeploy()
>>> rerank_service('path/to/model')
>>> input_data = {
... "query": "What is machine learning?",
... "documents": [
... "Machine learning is a branch of AI.",
... "Machine learning uses data to improve.",
... "Deep learning is a subset of machine learning."
... ],
... "top_n": 2
... }
>>> result = rerank_service(input_data)
Source code in lazyllm/components/deploy/embed.py
lazyllm.components.deploy.embed.LazyHuggingFaceRerank
Bases: object
Wrapper class for HuggingFace CrossEncoder-based reranking.
Ranks candidate documents by relevance score with respect to a given query.
Supports downloading and loading a specified rerank model at initialization, with optional lazy loading for faster startup.
Parameters:
-
base_rerank(str) –Name or local path of the rerank model. Supports HuggingFace Hub identifiers or local paths.
-
source(Optional[str], default:None) –Source of the model, supports
huggingfaceandmodelscope. Defaults to global configmodel_source. -
init(bool, default:False) –Whether to load the model immediately upon instantiation. If
False, the model will be loaded lazily on first call.
Source code in lazyllm/components/deploy/embed.py
load_reranker()
Load the rerank model.
This method initializes a sentence_transformers.CrossEncoder instance using self.base_rerank
and assigns it to the class attribute self.reranker for subsequent reranking tasks.
Source code in lazyllm/components/deploy/embed.py
rebuild(base_rerank, init)
classmethod
Class method to rebuild a LazyHuggingFaceRerank instance.
Used primarily for deserialization during pickle/cloudpickle operations,
reinstantiating the object with the provided parameters.
Parameters:
-
base_rerank(str) –Model name or path.
-
init(bool) –Whether to load the model immediately upon rebuilding.
Returns:
- LazyHuggingFaceRerank: The rebuilt class instance.
Source code in lazyllm/components/deploy/embed.py
lazyllm.components.deploy.embed.HuggingFaceEmbedding
HuggingFace embedding model management class for managing and registering different embedding model implementations.
Attributes:
-
_model_id_mapping(dict) –Mapping dictionary from model IDs to implementation classes.
Parameters:
-
base_embed(str) –Path or name of the base embedding model.
-
source(Optional[str], default:None) –Model source, defaults to None.
Source code in lazyllm/components/deploy/embed.py
get_emb_cls(model_name)
classmethod
Get the embedding implementation class for a model.
Parameters:
-
model_name(str) –Model name or path.
Returns:
- type: Returns corresponding embedding model implementation class, defaults to LazyHuggingFaceDefaultEmbedding if not found.
Source code in lazyllm/components/deploy/embed.py
register(model_ids)
classmethod
Decorator for registering model IDs to specific implementation classes.
Parameters:
-
model_ids(List[str]) –List of model IDs to register.
Returns:
- Callable: Returns decorator function.
Source code in lazyllm/components/deploy/embed.py
load_embed()
Load the embedding model.
This method calls the load_embed method of the internal embedding implementation class to load the model.
lazyllm.components.deploy.embed.LazyFlagEmbedding
Bases: object
A lazily loaded wrapper for the FlagEmbedding module.
This class encapsulates loading and usage of FlagEmbedding, with support for both sparse and dense embeddings. It leverages the lazyllm.once_flag() mechanism to initialize only once on demand, and integrates with LazyLLM's model downloading utilities.
Parameters:
-
base_embed(str) –The model name or path to be used as the embedding backend.
-
sparse(bool, default:False) –Whether to enable sparse embedding output. Defaults to False.
-
source(str, default:None) –Source URL or identifier for model downloading. Defaults to global config.
-
init(bool, default:False) –Whether to initialize the model immediately upon construction. Defaults to False.
Source code in lazyllm/components/deploy/embed.py
load_embed()
Load the embedding model onto the appropriate device.
This method selects the available device (GPU or CPU) and initializes the pretrained FlagEmbedding model from the provided path or model hub.
Source code in lazyllm/components/deploy/embed.py
rebuild(base_embed, sparse, init)
classmethod
Rebuild a LazyFlagEmbedding instance.
This class method reconstructs an instance of LazyFlagEmbedding, typically used during deserialization or multiprocessing scenarios.
Parameters:
-
base_embed(str) –The path or name of the embedding model.
-
sparse(bool) –Whether to enable sparse embedding mode.
-
init(bool) –Whether to load the model immediately during instantiation.
Returns:
- LazyFlagEmbedding: A newly constructed LazyFlagEmbedding instance.
Source code in lazyllm/components/deploy/embed.py
lazyllm.components.deploy.Mindie
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase, designed for deploying and managing the MindIE large language model inference service. It encapsulates the full workflow including configuration generation, process launching, and API interaction for the MindIE service.
Parameters:
-
trust_remote_code(bool, default:True) –Whether to trust remote code (e.g., from HuggingFace models). Default is
True. -
launcher–Instance of the task launcher. Default is
launchers.remote(). -
log_path(str, default:None) –Path to save logs. If
None, logs will not be saved. -
**kw–Other configuration parameters.
Other Parameters:
-
npuDeviceIds–List of NPU device IDs (e.g.,
[[0,1]]indicates using 2 devices) -
worldSize–Model parallelism size
-
port–Service port (set to
'auto'for auto-assignment between 30000–40000) -
maxSeqLen–Maximum sequence length
-
maxInputTokenLen–Maximum number of tokens per input
-
maxPrefillTokens–Maximum number of prefill tokens
-
config–Custom configuration file
Notes
You must set the environment variable LAZYLLM_MINDIE_HOME to point to the MindIE installation directory.
If finetuned_model is not specified or the path is invalid, it will automatically fall back to base_model.
Examples:
>>> import lazyllm
>>> from lazyllm.components.deploy import Mindie
>>> deployer = Mindie(
... port=30000,
... launcher=lazyllm.launchers.remote(),
... max_seq_len=32000,
... log_path="/path/to/logs"
... )
>>> cmd = deployer.cmd(
... finetuned_model="/path/to/finetuned_model",
... base_model="/path/to/base_model")
>>> print("Service URL:", cmd.geturl())
Source code in lazyllm/components/deploy/mindie.py
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 | |
cmd(finetuned_model=None, base_model=None, master_ip=None)
Generates the command to start the MindIE service.
Parameters:
-
finetuned_model(str, default:None) –Path to the fine-tuned model
-
base_model(str, default:None) –Path to the base model (fallback if finetuned_model is invalid)
-
master_ip(str, default:None) –Master node IP address (currently unused)
Returns:
- LazyLLMCMD: Command object for starting the service
Notes
- Automatically handles model path validation
- Updates configuration before service start
- Supports random port allocation when configured
Source code in lazyllm/components/deploy/mindie.py
extract_result(x, inputs)
staticmethod
Extracts the generated text from the API response.
Parameters:
-
x–Raw API response
-
inputs–Original inputs (unused)
Returns:
- str: The generated text
Notes
- Parses JSON response
- Returns first text entry from response
Source code in lazyllm/components/deploy/mindie.py
geturl(job=None)
Gets the service URL after deployment.
Parameters:
-
job–Job object (optional, defaults to self.job)
Returns:
- str: The generate endpoint URL
Notes
- Returns different formats based on display mode
- Includes port number from configuration
Source code in lazyllm/components/deploy/mindie.py
load_config(config_path)
Loads and parses the MindIE configuration file.
Parameters:
-
config_path(str) –Path to the JSON configuration file
Returns:
- dict: Parsed configuration dictionary
Notes
- Handles both default and custom configuration files
- Uses JSON format for configuration
- Creates backup of original config before modification
Source code in lazyllm/components/deploy/mindie.py
save_config()
Saves the current configuration to file.
Notes
- Automatically creates backup of existing config
- Writes to the standard MindIE config location
- Uses JSON format with proper indentation
- Called automatically during deployment
Source code in lazyllm/components/deploy/mindie.py
update_config()
Updates the configuration dictionary with current settings.
Notes
- Handles multiple configuration sections:
- Model deployment parameters
- Server settings
- Scheduling parameters
Source code in lazyllm/components/deploy/mindie.py
lazyllm.components.deploy.Infinity
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase, providing high-performance text-embeddings, reranking, and CLIP capabilities based on the Infinity framework.
Parameters:
-
launcher(launcher, default:remote(ngpus=1)) –The launcher for Infinity, defaulting to
launchers.remote(ngpus=1). -
kw–Keyword arguments for updating default training parameters. Note that no additional keyword arguments can be passed here except those listed below.
The keyword arguments and their default values for this class are as follows:
Other Parameters:
-
keys_name_handle(Dict) –Key name mapping dictionary.
-
message_format(Dict) –Default message format template.
-
default_headers(Dict) –Default HTTP request headers.
-
target_name(str) –API target endpoint name.
Examples:
>>> import lazyllm
>>> from lazyllm import deploy
>>> deploy.Infinity()
<lazyllm.llm.deploy type=Infinity>
Source code in lazyllm/components/deploy/infinity.py
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 | |
extract_result(x, inputs)
staticmethod
Extract result data from Infinity API response. Parses JSON response from Infinity service and extracts embedding vectors or reranking results based on the returned object type.
Parameters:
-
x(str) –JSON string response returned by API.
-
inputs(Dict) –Original input data used to determine the return format.
Source code in lazyllm/components/deploy/infinity.py
geturl(job=None)
Get the URL address of the Infinity service. Returns the corresponding API access URL address based on deployment mode and job status.
Parameters:
-
job(Optional[Any], default:None) –Job object, if None uses the current instance's job property.
Source code in lazyllm/components/deploy/infinity.py
lazyllm.components.deploy.OCRDeploy
Bases: LazyLLMDeployBase
OCRDeploy is a subclass of LazyLLMDeployBase that provides deployment for OCR (Optical Character Recognition) models. This class is designed to deploy OCR models with additional configurations such as logging, trust for remote code, and port customization.
Attributes:
keys_name_handle: A dictionary mapping input keys to their corresponding handler keys. For example:
- "inputs": Handles general inputs.
- "ocr_files": Also mapped to "inputs".
message_format: A dictionary specifying the expected message format. For example:
- {"inputs": "/path/to/pdf"} indicates that the model expects a PDF file path as input.
default_headers: A dictionary specifying default headers for API requests. Defaults to:
- {"Content-Type": "application/json"}
Parameters:
-
launcher–A launcher instance for deploying the model. Defaults to
None. -
log_path–A string specifying the path where logs should be saved. Defaults to
None. -
trust_remote_code–A boolean indicating whether to trust remote code execution. Defaults to
True. -
port–An integer specifying the port for the deployment server. Defaults to
None.
Returns:
-
–
OCRDeploy instance, can be started by calling
Examples:
>>> from lazyllm.components import OCRDeploy
>>> from lazyllm import launchers
>>> # 创建一个 OCRDeploy 实例
>>> deployer = OCRDeploy(launcher=launchers.local(), log_path='./logs', port=8080)
>>> # 使用微调的 OCR 模型部署服务器
>>> server = deployer(finetuned_model='ocr-model')
>>> # 打印部署服务器信息
>>> print(server)
... <RelayServer instance ready to handle OCR requests>
Source code in lazyllm/components/deploy/ocr/pp_ocr.py
lazyllm.components.deploy.BertDeploy
Bases: LazyLLMDeployBase
This class is a subclass of LazyLLMDeployBase, used to deploy HuggingFace-style sequence classification models. The service exposes an HTTP /generate endpoint via RelayServer (no standalone CLI). Usage mirrors StableDiffusionDeploy: calling the instance __call__(finetuned_model, base_model) returns a launchable Relay service.
Parameters:
-
launcher(Optional[LazyLLMLaunchersBase], default:None) –Launcher instance, defaults to
None. -
log_path(Optional[str], default:None) –Log file path, defaults to
None. -
trust_remote_code(bool, default:True) –Whether to pass
trust_remote_codewhen loading tokenizer/model, defaults toTrue. -
port(Optional[int], default:None) –Listen port;
Nonelets the framework assign one. -
max_length(int, default:512) –Maximum sequence length for tokenization, default
512. -
device(Optional[str], default:None) –Device for inference, e.g.
"cuda"or"cpu";Noneselects automatically. -
**kw–Unknown keyword arguments are logged as a warning and ignored.
Returns:
- JSON string from the worker with fields such as logits, probs, and predicted_label.
Examples:
>>> import lazyllm
>>> m = lazyllm.TrainableModule('xlm-roberta-base', use_model_map=False).deploy_method(
... lazyllm.deploy.BertDeploy, port=36001, max_length=128,
... ).start()
>>> # Two-sequence input: candidate span + surrounding context.
>>> pair_out = m('candidate_span', text_b='This is the surrounding context.')
>>> print(pair_out)
>>> # Single-sequence input: either omit ``text_b`` or pass an empty string.
>>> single_out = m('This is a single sequence.')
>>> print(single_out)
Source code in lazyllm/components/deploy/bert.py
108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 | |
lazyllm.components.deploy.relay.base.RelayServer
Bases: LazyLLMDeployBase
Source code in lazyllm/components/deploy/relay/base.py
cmd(func=None)
Source code in lazyllm/components/deploy/relay/base.py
lazyllm.components.deploy.text_to_speech.utils.TTSBase
Bases: LazyLLMDeployBase
Base class for TTS (Text-to-Speech) services.
Provides the deployment framework for text-to-speech services, supporting model loading and RelayServer deployment.
Parameters:
-
launcher(LazyLLMLaunchersBase, default:None) –Task launcher
-
log_path(str, default:None) –Log file path
-
port(int, default:None) –Service port number
Source code in lazyllm/components/deploy/text_to_speech/utils.py
Prompter
lazyllm.components.prompter.LazyLLMPrompterBase
LazyLLM prompter base class for managing and generating model prompts.
Parameters:
-
show(bool, default:False) –Whether to display generated prompts, defaults to False.
-
tools(Optional[List], default:None) –List of available tools, defaults to None.
-
history(Optional[List], default:None) –Conversation history, defaults to None.
Attributes:
-
ISA(str) –Instruction separator start token "<!lazyllm-spliter!>".
-
ISE(str) –Instruction separator end token "</!lazyllm-spliter!>".
Configuration Items
system: System role setting
sos/eos: Session start/end markers
soh/eoh: Human input start/end markers
soa/eoa: AI response start/end markers
soe/eoe: Tool execution result start/end markers
tool_start_token/tool_end_token: Tool call start/end markers
tool_args_token: Tool arguments marker
Source code in lazyllm/components/prompter/builtinPrompt.py
37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 | |
generate_prompt(input=None, history=None, tools=None, label=None, *, show=False, return_dict=False, format=None)
Generate a corresponding Prompt based on user input.
Parameters:
-
input(Option[str | Dict], default:None) –The input from the prompter, if it's a dict, it will be filled into the slots of the instruction; if it's a str, it will be used as input.
-
history(Option[List[List | Dict]], default:None) –Historical conversation, can be
[[u, s], [u, s]]or in openai's history format, defaults to None. -
tools(Option[List[Dict]], default:None) –A collection of tools that can be used, used when the large model performs FunctionCall, defaults to None.
-
label(Option[str], default:None) –Label, used during fine-tuning or training, defaults to None.
-
show(bool, default:False) –Flag indicating whether to print the generated Prompt, defaults to False.
-
return_dict(bool, default:False) –Deprecated; prefer
format="openai". WhenformatisNoneand this is True, behaves likeformat="openai"and emits a one-time deprecation warning. Defaults to False. -
format(Option[str], default:None) –Output structure.
Nonereturns a concatenated string for local/finetuning use;"openai"returns OpenAI-stylemessages(and optionaltools);"anthropic"returns Anthropic-stylesystem/messages.OnlineChatModulepasses the appropriate value. If bothreturn_dictandformatare provided,formattakes precedence. Defaults toNone.
Source code in lazyllm/components/prompter/builtinPrompt.py
get_response(output, input=None)
Used to truncate the Prompt, keeping only valuable output.
Parameters:
-
output(str) –The output of the large model.
-
input(Option[str], default:None) –The input of the large model. If this parameter is specified, any part of the output that includes the input will be completely truncated. Defaults to None.
Source code in lazyllm/components/prompter/builtinPrompt.py
pre_hook(func=None)
Sets a pre-processing hook function, allowing external custom processing of input data before prompt generation.
Parameters:
-
func(Optional[Callable], default:None) –A callable object to be used as the pre-processing hook function, which receives and processes input data.
Returns:
- LazyLLMPrompterBase: Returns the instance itself to support method chaining.
Source code in lazyllm/components/prompter/builtinPrompt.py
lazyllm.components.prompter.EmptyPrompter
Bases: LazyLLMPrompterBase
An empty prompt generator that inherits from LazyLLMPrompterBase, and directly returns the original input.
This class performs no formatting and is useful for debugging, testing, or as a placeholder.
Examples:
>>> from lazyllm.components.prompter import EmptyPrompter
>>> prompter = EmptyPrompter()
>>> prompter.generate_prompt("Hello LazyLLM")
'Hello LazyLLM'
>>> prompter.generate_prompt({"query": "Tell me a joke"})
{'query': 'Tell me a joke'}
>>> # Even with additional parameters, the input is returned unchanged
>>> prompter.generate_prompt("No-op", history=[["Hi", "Hello"]], tools=[{"name": "search"}], label="debug")
'No-op'
Source code in lazyllm/components/prompter/builtinPrompt.py
generate_prompt(input, history=None, tools=None, label=None, *, show=False, return_dict=False, format=None)
A prompt passthrough implementation that inherits from LazyLLMPrompterBase.
This method directly returns the input without any formatting. Useful for debugging, testing, or placeholder use.
Parameters:
-
input(Any) –The input to be returned directly as the prompt.
-
history(Option[List[List | Dict]], default:None) –Dialogue history, ignored. Defaults to None.
-
tools(Option[List[Dict]], default:None) –Tool definitions, ignored. Defaults to None.
-
label(Option[str], default:None) –Label, ignored. Defaults to None.
-
show(bool, default:False) –Whether to print the returned prompt. Defaults to False.
-
return_dict(bool, default:False) –Deprecated; prefer
format="openai". WhenformatisNoneand this is True, behaves likeformat="openai"with a one-time deprecation warning. -
format(Option[str], default:None) –If set (e.g.
"openai"), returns{"messages": [{"role": "user", "content": input}]}; otherwise returnsinputunchanged. If bothreturn_dictandformatare provided,formattakes precedence.
Source code in lazyllm/components/prompter/builtinPrompt.py
lazyllm.components.Prompter
Bases: object
Prompt generator class for LLM input formatting. Supports template-based prompting, history injection, and response extraction.
This class allows prompts to be defined via string templates, loaded from dicts, files, or predefined names. It supports history-aware formatting for multi-turn conversations and adapts to both mapping and string input types.
Parameters:
-
prompt(Optional[str], default:None) –Prompt template string with format placeholders.
-
response_split(Optional[str], default:None) –Optional delimiter to split model response and extract useful output.
-
chat_prompt(Optional[str], default:None) –Chat template string, must contain a history placeholder.
-
history_symbol(str, default:'llm_chat_history') –Name of the placeholder for historical messages, default is 'llm_chat_history'.
-
eoa(Optional[str], default:None) –Delimiter between assistant/user in history items.
-
eoh(Optional[str], default:None) –Delimiter between user-assistant pairs.
-
show(bool, default:False) –Whether to print the final prompt when generating. Default is False.
Examples:
>>> p = Prompter(prompt="Answer the following: {question}")
>>> p.generate_prompt("What is AI?")
'Answer the following: What is AI?'
>>> p.generate_prompt({"question": "Define machine learning"})
'Answer the following: Define machine learning'
>>> p = Prompter(
... prompt="Instruction: {instruction}",
... chat_prompt="Instruction: {instruction}\nHistory:\n{llm_chat_history}",
... history_symbol="llm_chat_history",
... eoa="</s>",
... eoh="|"
... )
>>> p.generate_prompt(
... input={"instruction": "Translate this."},
... history=[["hello", "你好"], ["how are you", "你好吗"]]
... )
'Instruction: Translate this.\nHistory:\nhello|你好</s>how are you|你好吗'
>>> prompt_conf = {
... "prompt": "Task: {task}",
... "response_split": "---"
... }
>>> p = Prompter.from_dict(prompt_conf)
>>> p.generate_prompt("Summarize this article.")
'Task: Summarize this article.'
>>> full_output = "Task: Summarize this article.---This is the summary."
>>> p.get_response(full_output)
'This is the summary.'
Source code in lazyllm/components/prompter/prompter.py
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 | |
from_dict(prompt, *, show=False)
classmethod
Initializes a Prompter instance from a prompt configuration dictionary.
Parameters:
-
prompt(Dict) –A dictionary containing prompt-related configuration. Must include 'prompt' key.
-
show(bool, default:False) –Whether to display the generated prompt. Defaults to False.
Returns:
- Prompter: An initialized Prompter instance.
Source code in lazyllm/components/prompter/prompter.py
from_template(template_name, *, show=False)
classmethod
Loads prompt configuration from a template name and initializes a Prompter instance.
Parameters:
-
template_name(str) –Name of the template. Must exist in the
templatesdictionary. -
show(bool, default:False) –Whether to display the generated prompt. Defaults to False.
Returns:
- Prompter: An initialized Prompter instance.
Source code in lazyllm/components/prompter/prompter.py
from_file(fname, *, show=False)
classmethod
Loads prompt configuration from a JSON file and initializes a Prompter instance.
Parameters:
-
fname(str) –Path to the JSON configuration file.
-
show(bool, default:False) –Whether to display the generated prompt. Defaults to False.
Returns:
-
Prompter–An initialized Prompter instance.
Source code in lazyllm/components/prompter/prompter.py
empty()
classmethod
Creates an empty Prompter instance.
Returns:
-
Prompter–A Prompter instance without any prompt configuration.
generate_prompt(input, history=None, tools=None, label=None, show=False)
Generates a formatted prompt string based on input and optional conversation history.
Parameters:
-
input(Union[str, Dict]) –User input. Can be a single string or a dictionary with multiple fields.
-
history(Optional[List[List[str]]], default:None) –Multi-turn dialogue history, e.g., [['u1', 'a1'], ['u2', 'a2']].
-
tools(Optional[Any], default:None) –Not supported. Must be None.
-
label(Optional[str], default:None) –Optional label to append to the prompt, commonly used for training.
-
show(bool, default:False) –Whether to print the generated prompt. Defaults to False.
Returns:
-
str–The final formatted prompt string.
Source code in lazyllm/components/prompter/prompter.py
get_response(response, input=None)
Extracts the actual model answer from the full response returned by an LLM.
Parameters:
-
response(str) –The full raw output from the model.
-
input(Optional[str], default:None) –If the response starts with the input, that part will be removed.
Returns:
-
str–The cleaned model response.
Source code in lazyllm/components/prompter/prompter.py
options: heading_level: 3 inherited_members: - generate_prompt - get_response members: false
lazyllm.components.prompter.EmptyPrompter
Bases: LazyLLMPrompterBase
An empty prompt generator that inherits from LazyLLMPrompterBase, and directly returns the original input.
This class performs no formatting and is useful for debugging, testing, or as a placeholder.
Examples:
>>> from lazyllm.components.prompter import EmptyPrompter
>>> prompter = EmptyPrompter()
>>> prompter.generate_prompt("Hello LazyLLM")
'Hello LazyLLM'
>>> prompter.generate_prompt({"query": "Tell me a joke"})
{'query': 'Tell me a joke'}
>>> # Even with additional parameters, the input is returned unchanged
>>> prompter.generate_prompt("No-op", history=[["Hi", "Hello"]], tools=[{"name": "search"}], label="debug")
'No-op'
Source code in lazyllm/components/prompter/builtinPrompt.py
generate_prompt(input, history=None, tools=None, label=None, *, show=False, return_dict=False, format=None)
A prompt passthrough implementation that inherits from LazyLLMPrompterBase.
This method directly returns the input without any formatting. Useful for debugging, testing, or placeholder use.
Parameters:
-
input(Any) –The input to be returned directly as the prompt.
-
history(Option[List[List | Dict]], default:None) –Dialogue history, ignored. Defaults to None.
-
tools(Option[List[Dict]], default:None) –Tool definitions, ignored. Defaults to None.
-
label(Option[str], default:None) –Label, ignored. Defaults to None.
-
show(bool, default:False) –Whether to print the returned prompt. Defaults to False.
-
return_dict(bool, default:False) –Deprecated; prefer
format="openai". WhenformatisNoneand this is True, behaves likeformat="openai"with a one-time deprecation warning. -
format(Option[str], default:None) –If set (e.g.
"openai"), returns{"messages": [{"role": "user", "content": input}]}; otherwise returnsinputunchanged. If bothreturn_dictandformatare provided,formattakes precedence.
Source code in lazyllm/components/prompter/builtinPrompt.py
lazyllm.components.Prompter
Bases: object
Prompt generator class for LLM input formatting. Supports template-based prompting, history injection, and response extraction.
This class allows prompts to be defined via string templates, loaded from dicts, files, or predefined names. It supports history-aware formatting for multi-turn conversations and adapts to both mapping and string input types.
Parameters:
-
prompt(Optional[str], default:None) –Prompt template string with format placeholders.
-
response_split(Optional[str], default:None) –Optional delimiter to split model response and extract useful output.
-
chat_prompt(Optional[str], default:None) –Chat template string, must contain a history placeholder.
-
history_symbol(str, default:'llm_chat_history') –Name of the placeholder for historical messages, default is 'llm_chat_history'.
-
eoa(Optional[str], default:None) –Delimiter between assistant/user in history items.
-
eoh(Optional[str], default:None) –Delimiter between user-assistant pairs.
-
show(bool, default:False) –Whether to print the final prompt when generating. Default is False.
Examples:
>>> p = Prompter(prompt="Answer the following: {question}")
>>> p.generate_prompt("What is AI?")
'Answer the following: What is AI?'
>>> p.generate_prompt({"question": "Define machine learning"})
'Answer the following: Define machine learning'
>>> p = Prompter(
... prompt="Instruction: {instruction}",
... chat_prompt="Instruction: {instruction}\nHistory:\n{llm_chat_history}",
... history_symbol="llm_chat_history",
... eoa="</s>",
... eoh="|"
... )
>>> p.generate_prompt(
... input={"instruction": "Translate this."},
... history=[["hello", "你好"], ["how are you", "你好吗"]]
... )
'Instruction: Translate this.\nHistory:\nhello|你好</s>how are you|你好吗'
>>> prompt_conf = {
... "prompt": "Task: {task}",
... "response_split": "---"
... }
>>> p = Prompter.from_dict(prompt_conf)
>>> p.generate_prompt("Summarize this article.")
'Task: Summarize this article.'
>>> full_output = "Task: Summarize this article.---This is the summary."
>>> p.get_response(full_output)
'This is the summary.'
Source code in lazyllm/components/prompter/prompter.py
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 | |
from_dict(prompt, *, show=False)
classmethod
Initializes a Prompter instance from a prompt configuration dictionary.
Parameters:
-
prompt(Dict) –A dictionary containing prompt-related configuration. Must include 'prompt' key.
-
show(bool, default:False) –Whether to display the generated prompt. Defaults to False.
Returns:
- Prompter: An initialized Prompter instance.
Source code in lazyllm/components/prompter/prompter.py
from_template(template_name, *, show=False)
classmethod
Loads prompt configuration from a template name and initializes a Prompter instance.
Parameters:
-
template_name(str) –Name of the template. Must exist in the
templatesdictionary. -
show(bool, default:False) –Whether to display the generated prompt. Defaults to False.
Returns:
- Prompter: An initialized Prompter instance.
Source code in lazyllm/components/prompter/prompter.py
from_file(fname, *, show=False)
classmethod
Loads prompt configuration from a JSON file and initializes a Prompter instance.
Parameters:
-
fname(str) –Path to the JSON configuration file.
-
show(bool, default:False) –Whether to display the generated prompt. Defaults to False.
Returns:
-
Prompter–An initialized Prompter instance.
Source code in lazyllm/components/prompter/prompter.py
empty()
classmethod
Creates an empty Prompter instance.
Returns:
-
Prompter–A Prompter instance without any prompt configuration.
generate_prompt(input, history=None, tools=None, label=None, show=False)
Generates a formatted prompt string based on input and optional conversation history.
Parameters:
-
input(Union[str, Dict]) –User input. Can be a single string or a dictionary with multiple fields.
-
history(Optional[List[List[str]]], default:None) –Multi-turn dialogue history, e.g., [['u1', 'a1'], ['u2', 'a2']].
-
tools(Optional[Any], default:None) –Not supported. Must be None.
-
label(Optional[str], default:None) –Optional label to append to the prompt, commonly used for training.
-
show(bool, default:False) –Whether to print the generated prompt. Defaults to False.
Returns:
-
str–The final formatted prompt string.
Source code in lazyllm/components/prompter/prompter.py
get_response(response, input=None)
Extracts the actual model answer from the full response returned by an LLM.
Parameters:
-
response(str) –The full raw output from the model.
-
input(Optional[str], default:None) –If the response starts with the input, that part will be removed.
Returns:
-
str–The cleaned model response.
Source code in lazyllm/components/prompter/prompter.py
lazyllm.components.AlpacaPrompter
Bases: LazyLLMPrompterBase
Alpaca-style Prompter, supports tool calls, does not support historical dialogue.
Parameters:
-
instruction(Option[str], default:None) –Task instructions for the large model, with at least one fillable slot (e.g.
{instruction}). Or use a dictionary to specify thesystemanduserinstructions. -
extra_keys(Option[List], default:None) –Additional fields that will be filled with user input.
-
show(bool, default:False) –Flag indicating whether to print the generated Prompt, default is False.
-
tools(Option[list], default:None) –Tool-set which is provived for LLMs, default is None.
Examples:
>>> from lazyllm import AlpacaPrompter
>>> p = AlpacaPrompter('hello world {instruction}')
>>> p.generate_prompt('this is my input')
'You are an AI-Agent developed by LazyLLM.\nBelow is an instruction that describes a task, paired with extra messages such as input that provides further context if possible. Write a response that appropriately completes the request.\n\n ### Instruction:\nhello world this is my input\n\n\n### Response:\n'
>>> p.generate_prompt('this is my input', format='openai')
{'messages': [{'role': 'system', 'content': 'You are an AI-Agent developed by LazyLLM.\nBelow is an instruction that describes a task, paired with extra messages such as input that provides further context if possible. Write a response that appropriately completes the request.\n\n ### Instruction:\nhello world this is my input\n\n'}, {'role': 'user', 'content': ''}]}
>>>
>>> p = AlpacaPrompter('hello world {instruction}, {input}', extra_keys=['knowledge'])
>>> p.generate_prompt(dict(instruction='hello world', input='my input', knowledge='lazyllm'))
'You are an AI-Agent developed by LazyLLM.\nBelow is an instruction that describes a task, paired with extra messages such as input that provides further context if possible. Write a response that appropriately completes the request.\n\n ### Instruction:\nhello world hello world, my input\n\nHere are some extra messages you can referred to:\n\n### knowledge:\nlazyllm\n\n\n### Response:\n'
>>> p.generate_prompt(dict(instruction='hello world', input='my input', knowledge='lazyllm'), format='openai')
{'messages': [{'role': 'system', 'content': 'You are an AI-Agent developed by LazyLLM.\nBelow is an instruction that describes a task, paired with extra messages such as input that provides further context if possible. Write a response that appropriately completes the request.\n\n ### Instruction:\nhello world hello world, my input\n\nHere are some extra messages you can referred to:\n\n### knowledge:\nlazyllm\n\n'}, {'role': 'user', 'content': ''}]}
>>>
>>> p = AlpacaPrompter(dict(system="hello world", user="this is user instruction {input}"))
>>> p.generate_prompt(dict(input="my input"))
'You are an AI-Agent developed by LazyLLM.\nBelow is an instruction that describes a task, paired with extra messages such as input that provides further context if possible. Write a response that appropriately completes the request.\n\n ### Instruction:\nhello word\n\n\n\nthis is user instruction my input### Response:\n'
>>> p.generate_prompt(dict(input="my input"), format='openai')
{'messages': [{'role': 'system', 'content': 'You are an AI-Agent developed by LazyLLM.\nBelow is an instruction that describes a task, paired with extra messages such as input that provides further context if possible. Write a response that appropriately completes the request.\n\n ### Instruction:\nhello world'}, {'role': 'user', 'content': 'this is user instruction my input'}]}
Source code in lazyllm/components/prompter/alpacaPrompter.py
generate_prompt(input=None, history=None, tools=None, label=None, *, show=False, return_dict=False, format=None)
Generate a corresponding Prompt based on user input.
Parameters:
-
input(Option[str | Dict], default:None) –The input from the prompter, if it's a dict, it will be filled into the slots of the instruction; if it's a str, it will be used as input.
-
history(Option[List[List | Dict]], default:None) –Historical conversation, can be
[[u, s], [u, s]]or in openai's history format, defaults to None. -
tools(Option[List[Dict]], default:None) –A collection of tools that can be used, used when the large model performs FunctionCall, defaults to None.
-
label(Option[str], default:None) –Label, used during fine-tuning or training, defaults to None.
-
show(bool, default:False) –Flag indicating whether to print the generated Prompt, defaults to False.
-
return_dict(bool, default:False) –Deprecated; prefer
format="openai". WhenformatisNoneand this is True, behaves likeformat="openai"and emits a one-time deprecation warning. Defaults to False. -
format(Option[str], default:None) –Output structure.
Nonereturns a concatenated string for local/finetuning use;"openai"returns OpenAI-stylemessages(and optionaltools);"anthropic"returns Anthropic-stylesystem/messages.OnlineChatModulepasses the appropriate value. If bothreturn_dictandformatare provided,formattakes precedence. Defaults toNone.
Source code in lazyllm/components/prompter/builtinPrompt.py
get_response(output, input=None)
Used to truncate the Prompt, keeping only valuable output.
Parameters:
-
output(str) –The output of the large model.
-
input(Option[str], default:None) –The input of the large model. If this parameter is specified, any part of the output that includes the input will be completely truncated. Defaults to None.
Source code in lazyllm/components/prompter/builtinPrompt.py
lazyllm.components.ChatPrompter
Bases: LazyLLMPrompterBase
Prompt constructor for multi-turn dialogue, inherits from LazyLLMPrompterBase.
Supports tool calling, conversation history, and customizable instruction templates. Accepts instructions as either plain string or dict with separate system and user components, automatically merging them into a unified prompt template. Also supports injecting extra user-defined fields.
Parameters:
-
instruction(Option[str | Dict[str, str]], default:None) –The prompt instruction template. Can be a string or a dict with
systemanduserkeys. If a dict is given, the components will be merged using special delimiters. -
extra_keys(Option[List[str]], default:None) –A list of additional keys that will be filled by user input to enrich the prompt context.
-
show(bool, default:False) –Whether to print the generated prompt. Default is False.
-
tools(Option[List], default:None) –A list of tools available to the model for function-calling tasks. Default is None.
-
history(Option[List[List[str]]], default:None) –Dialogue history in the format [[user, assistant], ...]. Used to provide conversational memory. Default is None.
Examples:
- Simple instruction string
>>> p = ChatPrompter('hello world')
>>> p.generate_prompt('this is my input')
'You are an AI-Agent developed by LazyLLM.hello world\nthis is my input\n'
>>> p.generate_prompt('this is my input', format='openai')
{'messages': [{'role': 'system', 'content': 'You are an AI-Agent developed by LazyLLM.\nhello world'}, {'role': 'user', 'content': 'this is my input'}]}
- Using extra_keys
>>> p = ChatPrompter('hello world {instruction}', extra_keys=['knowledge'])
>>> p.generate_prompt({
... 'instruction': 'this is my ins',
... 'input': 'this is my inp',
... 'knowledge': 'LazyLLM-Knowledge'
... })
'You are an AI-Agent developed by LazyLLM.hello world this is my ins\nHere are some extra messages you can referred to:\n\n### knowledge:\nLazyLLM-Knowledge\nthis is my inp\n'
- With conversation history
>>> p.generate_prompt({
... 'instruction': 'this is my ins',
... 'input': 'this is my inp',
... 'knowledge': 'LazyLLM-Knowledge'
... }, history=[['s1', 'e1'], ['s2', 'e2']])
'You are an AI-Agent developed by LazyLLM.hello world this is my ins\nHere are some extra messages you can referred to:\n\n### knowledge:\nLazyLLM-Knowledge\ns1|e1\ns2|e2\nthis is my inp\n'
- Using dict format for system/user instructions
>>> p = ChatPrompter(dict(system="hello world", user="this is user instruction {input}"))
>>> p.generate_prompt({'input': "my input", 'query': "this is user query"})
'You are an AI-Agent developed by LazyLLM.hello world\nthis is user instruction my input this is user query\n'
>>> p.generate_prompt({'input': "my input", 'query': "this is user query"}, format='openai')
{'messages': [{'role': 'system', 'content': 'You are an AI-Agent developed by LazyLLM.\nhello world'}, {'role': 'user', 'content': 'this is user instruction my input this is user query'}]}
Source code in lazyllm/components/prompter/chatPrompter.py
generate_prompt(input=None, history=None, tools=None, label=None, *, show=False, return_dict=False, format=None)
Generate a corresponding Prompt based on user input.
Parameters:
-
input(Option[str | Dict], default:None) –The input from the prompter, if it's a dict, it will be filled into the slots of the instruction; if it's a str, it will be used as input.
-
history(Option[List[List | Dict]], default:None) –Historical conversation, can be
[[u, s], [u, s]]or in openai's history format, defaults to None. -
tools(Option[List[Dict]], default:None) –A collection of tools that can be used, used when the large model performs FunctionCall, defaults to None.
-
label(Option[str], default:None) –Label, used during fine-tuning or training, defaults to None.
-
show(bool, default:False) –Flag indicating whether to print the generated Prompt, defaults to False.
-
return_dict(bool, default:False) –Deprecated; prefer
format="openai". WhenformatisNoneand this is True, behaves likeformat="openai"and emits a one-time deprecation warning. Defaults to False. -
format(Option[str], default:None) –Output structure.
Nonereturns a concatenated string for local/finetuning use;"openai"returns OpenAI-stylemessages(and optionaltools);"anthropic"returns Anthropic-stylesystem/messages.OnlineChatModulepasses the appropriate value. If bothreturn_dictandformatare provided,formattakes precedence. Defaults toNone.
Source code in lazyllm/components/prompter/builtinPrompt.py
get_response(output, input=None)
Used to truncate the Prompt, keeping only valuable output.
Parameters:
-
output(str) –The output of the large model.
-
input(Option[str], default:None) –The input of the large model. If this parameter is specified, any part of the output that includes the input will be completely truncated. Defaults to None.
Source code in lazyllm/components/prompter/builtinPrompt.py
MultiModal
Text to Image
lazyllm.components.StableDiffusionDeploy
Bases: LazyLLMDeployBase
Stable Diffusion Model Deployment Class. This class is used to deploy the stable diffusion model to a specified server for network invocation.
Parameters:
-
launcher(Optional[LazyLLMLaunchersBase], default:None) –Launcher instance. Defaults to
None -
log_path(Optional[str], default:None) –Log file path. Defaults to
None -
trust_remote_code(bool, default:True) –Whether to trust remote code. Defaults to
True -
port(Optional[int], default:None) –Service port number. Defaults to
None
Examples:
>>> from lazyllm import launchers, UrlModule
>>> from lazyllm.components import StableDiffusionDeploy
>>> deployer = StableDiffusionDeploy(launchers.remote())
>>> url = deployer(base_model='stable-diffusion-3-medium')
>>> model = UrlModule(url=url)
>>> res = model('a tiny cat.')
>>> print(res)
... <lazyllm-query>{"query": "", "files": ["path/to/sd3/image_xxx.png"]}
Source code in lazyllm/components/deploy/stable_diffusion/stable_diffusion3.py
Visual Question Answering
Reference LMDeploy, which supports the Visual Question Answering model.
Text to Sound
lazyllm.components.TTSDeploy
Source code in lazyllm/components/deploy/text_to_speech/__init__.py
lazyllm.components.ChatTTSDeploy
Bases: TTSBase
ChatTTS Model Deployment Class.
Other Parameters:
-
keys_name_handle(dict) –A key mapping dictionary used to handle parameter name conversion between internal and external API interfaces. Defaults to
{'inputs': 'inputs'}. -
message_format(dict) –The request payload structure containing three main sections:
-
inputs(str): The raw text content to be synthesized into speech. -
refinetext(dict): Text refinement and stylization parameters controlling speech expression:-
prompt(str): Voice style control tags, e.g., "[oral_2][laugh_0][break_6]" -
top_P(float): Nucleus sampling parameter for decoding strategy (default: 0.7) -
top_K(int): Top-K sampling parameter (default: 20) -
temperature(float): Sampling temperature controlling randomness (default: 0.7) -
repetition_penalty(float): Repetition penalty to avoid redundant generation (default: 1.0) -
max_new_token(int): Maximum number of tokens to generate (default: 384) -
min_new_token(int): Minimum number of tokens to generate (default: 0) -
show_tqdm(bool): Whether to display progress bar during generation (default: True) -
ensure_non_empty(bool): Ensure non-empty generation result (default: True)
-
-
infercode(dict): Inference and encoding parameters affecting audio quality:-
prompt(str): Voice speed control tags, e.g., "[speed_5]" -
spk_emb(Optional): Speaker embedding vector for specifying voice characteristics (default: None) -
temperature(float): Sampling temperature for audio generation (default: 0.3) -
repetition_penalty(float): Repetition penalty coefficient (default: 1.05) -
max_new_token(int): Maximum number of tokens for audio generation (default: 2048)
-
-
Examples:
>>> from lazyllm import launchers, UrlModule
>>> from lazyllm.components import ChatTTSDeploy
>>> deployer = ChatTTSDeploy(launchers.remote())
>>> url = deployer(base_model='ChatTTS')
>>> model = UrlModule(url=url)
>>> res = model('Hello World!')
>>> print(res)
... <lazyllm-query>{"query": "", "files": ["path/to/chattts/sound_xxx.wav"]}
Source code in lazyllm/components/deploy/text_to_speech/chattts.py
lazyllm.components.BarkDeploy
Bases: TTSBase
Bark Model Deployment Class. This class is used to deploy the Bark model to a specified server for network invocation.
__init__(self, launcher=None)
Constructor, initializes the deployment class.
Parameters:
-
launcher(launcher, default:None) –An instance of the launcher used to start the remote service.
__call__(self, finetuned_model=None, base_model=None)
Deploys the model and returns the remote service address.
Parameters:
-
finetuned_model(str) –If provided, this model will be used for deployment; if not provided or the path is invalid,
base_modelwill be used. -
base_model(str) –The default model, which will be used for deployment if
finetuned_modelis invalid. -
Return(str) –The URL address of the remote service.
Notes
- Input for infer:
str. The text corresponding to the audio to be generated. - Return of infer: The string encoded from the generated file paths, starting with the encoding flag "
", followed by the serialized dictionary. The key filesin the dictionary stores a list, with elements being the paths of the generated audio files. - Supported models: bark
Examples:
>>> from lazyllm import launchers, UrlModule
>>> from lazyllm.components import BarkDeploy
>>> deployer = BarkDeploy(launchers.remote())
>>> url = deployer(base_model='bark')
>>> model = UrlModule(url=url)
>>> res = model('Hello World!')
>>> print(res)
... <lazyllm-query>{"query": "", "files": ["path/to/bark/sound_xxx.wav"]}
Source code in lazyllm/components/deploy/text_to_speech/bark.py
lazyllm.components.MusicGenDeploy
Bases: TTSBase
MusicGen Model Deployment Class. This class is used to deploy the MusicGen model to a specified server for network invocation.
__init__(self, launcher=None)
Constructor, initializes the deployment class.
Parameters:
-
launcher(launcher, default:None) –An instance of the launcher used to start the remote service.
__call__(self, finetuned_model=None, base_model=None)
Deploys the model and returns the remote service address.
Parameters:
-
finetuned_model(str) –If provided, this model will be used for deployment; if not provided or the path is invalid,
base_modelwill be used. -
base_model(str) –The default model, which will be used for deployment if
finetuned_modelis invalid. -
Return(str) –The URL address of the remote service.
Notes
- Input for infer:
str. The text corresponding to the audio to be generated. - Return of infer: The string encoded from the generated file paths, starting with the encoding flag "
", followed by the serialized dictionary. The key filesin the dictionary stores a list, with elements being the paths of the generated audio files. - Supported models: musicgen-small
Examples:
>>> from lazyllm import launchers, UrlModule
>>> from lazyllm.components import MusicGenDeploy
>>> deployer = MusicGenDeploy(launchers.remote())
>>> url = deployer(base_model='musicgen-small')
>>> model = UrlModule(url=url)
>>> model('Symphony with flute as the main melody')
... <lazyllm-query>{"query": "", "files": ["path/to/musicgen/sound_xxx.wav"]}
Source code in lazyllm/components/deploy/text_to_speech/musicgen.py
Speech to Text
lazyllm.components.SenseVoiceDeploy
Bases: LazyLLMDeployBase
SenseVoice Model Deployment Class. This class is used to deploy the SenseVoice model to a specified server for network invocation.
__init__(self, launcher=None)
Constructor, initializes the deployment class.
Parameters:
-
launcher(Optional[LazyLLMLaunchersBase], default:None) –Launcher instance, defaults to None.
-
log_path(Optional[str], default:None) –Log file path, defaults to None.
-
trust_remote_code(bool, default:True) –Whether to trust remote code, defaults to True.
-
port(Optional[int], default:None) –Service port number, defaults to None.
Notes
- Input for infer:
str. The audio path or link. - Return of infer:
str. The recognized content. - Supported models: SenseVoiceSmall
Examples:
>>> import os
>>> import lazyllm
>>> from lazyllm import launchers, UrlModule
>>> from lazyllm.components import SenseVoiceDeploy
>>> deployer = SenseVoiceDeploy(launchers.remote())
>>> url = deployer(base_model='SenseVoiceSmall')
>>> model = UrlModule(url=url)
>>> model('path/to/audio') # support format: .mp3, .wav
... xxxxxxxxxxxxxxxx
Source code in lazyllm/components/deploy/speech_to_text/sense_voice.py
lazyllm.components.deploy.speech_to_text.sense_voice.SenseVoice
Bases: object
The SenseVoice class encapsulates FunASR-based speech-to-text model loading and invocation.
It supports lazy initialization, automatic model downloading, and accepts string paths, URLs, or dicts containing audio.
Parameters:
-
base_path(str) –Model path or identifier, downloaded locally via ModelManager.
-
source(Optional[str], default:None) –Model source, defaults to
lazyllm.config['model_source']if not specified. -
init(bool, default:False) –Whether to load the model immediately during initialization. Defaults to
False.
Attributes:
-
base_path(str) –Resolved local path of the downloaded model.
-
model(Optional[AutoModel]) –Instance of the FunASR speech recognition model, available after initialization.
-
init_flag–A flag used for lazy loading, ensuring the model is loaded only once.
Source code in lazyllm/components/deploy/speech_to_text/sense_voice.py
load_stt()
Initializes and loads the FunASR speech-to-text model. Supports Huawei NPU acceleration if torch_npu is available.
Uses fsmn-vad for voice activity detection (VAD), supporting long utterances.
Maximum single segment duration is set to 30 seconds.
Default inference device is cuda:0 (GPU).
The loaded model is assigned to self.model for subsequent audio transcription.
Note:
- If the environment has torch_npu installed, the method will import it to enable Ascend NPU acceleration.
Source code in lazyllm/components/deploy/speech_to_text/sense_voice.py
rebuild(base_path, init)
classmethod
Class method to reconstruct a SenseVoice instance during deserialization (e.g., with cloudpickle).
Parameters:
-
base_path(str) –Path to the speech-to-text model.
-
init(bool) –Whether to initialize and load the model upon instantiation.
Returns:
- SenseVoice: A new
SenseVoiceinstance, used for serialization/multiprocessing compatibility.
Source code in lazyllm/components/deploy/speech_to_text/sense_voice.py
ModelManager
lazyllm.components.ModelManager
ModelManager is a utility class in LazyLLM for managing and downloading models, supporting local search and Huggingface/Modelscope downloads.
Parameters:
-
model_source(Optional[str]) –Model download source, only
huggingfaceormodelscopesupported. Defaults to LAZYLLM_MODEL_SOURCE, andmodelscopeif unset. -
token(Optional[str], default:config['model_source_token']) –Access token for private models. Defaults to LAZYLLM_MODEL_SOURCE_TOKEN.
-
model_path(Optional[str], default:config['model_path']) –Colon-separated list of local absolute paths to search before download. Defaults to LAZYLLM_MODEL_PATH.
-
cache_dir(Optional[str], default:config['model_cache_dir']) –Directory for downloaded models. Defaults to LAZYLLM_MODEL_CACHE_DIR, or
~/.lazyllm/model.
Static Methods
get_model_type(model: str) -> str
Returns model type, e.g., llm or chat; returns llm if unrecognized.
get_model_prompt_keys(model: str) -> dict
Returns the prompt key mapping dictionary for the model.
validate_model_path(model_path: str) -> bool
Checks if directory contains valid model files (extensions: .pt, .bin, .safetensors).
Instance Methods
download(model: Optional[str] = '', call_back: Optional[Callable] = None) -> str | bool Downloads the specified model. Process: 1. Search in local directories listed in model_path; 2. If not found, search in cache_dir; 3. If still not found, download from model_source to cache_dir.
Args:
model (Optional[str]): Target model name, can be abbreviated or full name from source.
call_back (Optional[Callable]): Optional callback function for download progress.
Examples:
>>> from lazyllm.components import ModelManager
>>> downloader = ModelManager(model_source='modelscope')
>>> downloader.download('chatglm3-6b')
Source code in lazyllm/components/utils/downloader/model_downloader.py
121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 | |
get_model_type(model)
cached
staticmethod
Retrieve the type of a model (e.g., LLM, VLM) based on its name.
Parameters:
-
model(str) –Model name or path, must be a non-empty string.
Returns:
- str: Model type, returns
llmif no match is found.
Source code in lazyllm/components/utils/downloader/model_downloader.py
get_model_prompt_keys(model)
cached
staticmethod
Get the prompt key mapping dictionary for the specified model, used for constructing inputs during inference.
Parameters:
-
model(str) –Model name or path.
Returns:
- dict: The prompt key mapping for the model, or an empty dictionary if none exists
Source code in lazyllm/components/utils/downloader/model_downloader.py
validate_model_path(model_path)
staticmethod
Check whether the specified path contains valid model files (.pt, .bin, .safetensors).
Parameters:
-
model_path(str) –Path to the model directory.
Returns:
- bool: True if model files exist in the directory, False otherwise
Source code in lazyllm/components/utils/downloader/model_downloader.py
download(model='', call_back=None)
Download the specified model by name. If it already exists locally, return the local path.
Supports automatic download from Huggingface and Modelscope, creating a symbolic link in the cache directory for unified management.
Parameters:
-
model(str, default:'') –Model name or path, defaults to empty string which means no download.
-
call_back(Optional[Callable], default:None) –Callback function for download progress, receives current download status.
Returns:
- str | bool: Full local path to the model, or False if the download fails
Source code in lazyllm/components/utils/downloader/model_downloader.py
Formatter
lazyllm.components.formatter.LazyLLMFormatterBase
This class is the base class of the formatter. The formatter is the formatter of the model output result. Users can customize the formatter or use the formatter provided by LazyLLM.
Examples:
>>> from lazyllm.components.formatter import LazyLLMFormatterBase
>>> class MyFormatter(LazyLLMFormatterBase):
... def __init__(self, formatter: str = None):
... self._formatter = formatter
... if self._formatter:
... self._parse_formatter()
... else:
... self._slices = None
... def _parse_formatter(self):
... slice_str = self._formatter.strip()[1:-1]
... slices = []
... parts = slice_str.split(":")
... start = int(parts[0]) if parts[0] else None
... end = int(parts[1]) if len(parts) > 1 and parts[1] else None
... step = int(parts[2]) if len(parts) > 2 and parts[2] else None
... slices.append(slice(start, end, step))
... self._slices = slices
... def _load(self, data):
... return [int(x) for x in data.strip('[]').split(',')]
... def _parse_py_data_by_formatter(self, data):
... if self._slices is not None:
... result = []
... for s in self._slices:
... if isinstance(s, slice):
... result.extend(data[s])
... else:
... result.append(data[int(s)])
... return result
... else:
... return data
...
>>> fmt = MyFormatter("[1:3]")
>>> res = fmt.format("[1,2,3,4,5]")
>>> print(res)
[2, 3]
Source code in lazyllm/components/formatter/formatterbase.py
format(msg)
Format input message.
Parameters:
-
msg–Input message, can be string or other format
Returns:
- Formatted data, specific type determined by subclass implementation
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.formatterbase.JsonLikeFormatter
Bases: LazyLLMFormatterBase
This class is used to extract subfields from nested structures (like dicts, lists, tuples) using a JSON-like indexing syntax.
The behavior is driven by a formatter string similar to Python-style slicing and dictionary access:
[0]fetches the first item[0][{key}]accesses thekeyfield in the first item[0,1][{a,b}]fetches theaandbfields from the first and second items[::2]does slicing with a step of 2*[0][{x}]means return a wrapped/structured result
Parameters:
-
formatter(str, default:None) –A format string that controls how to slice and extract the structure. If None, the input will be returned directly.
Examples:
>>> from lazyllm.components.formatter.formatterbase import JsonLikeFormatter
>>> formatter = JsonLikeFormatter("[{a,b}]")
Source code in lazyllm/components/formatter/formatterbase.py
128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 | |
lazyllm.components.formatter.formatterbase.PythonFormatter
Bases: JsonLikeFormatter
Reserved formatter class for supporting Python-style data extraction syntax. To be developed.
Currently inherits from JsonLikeFormatter with no additional behavior.
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.FileFormatter
Bases: LazyLLMFormatterBase
A formatter that transforms query strings with document context between structured formats.
Supports three modes:
- "decode": Decodes structured query strings into dictionaries with
queryandfiles. - "encode": Encodes a dictionary with
queryandfilesinto a structured query string. - "merge": Merges multiple structured query strings into one.
Parameters:
-
formatter(str, default:'decode') –The operation mode. Must be one of "decode", "encode", or "merge". Defaults to "decode".
Examples:
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.YamlFormatter
Bases: JsonLikeFormatter
A formatter for extracting structured information from YAML-formatted strings.
Inherits from JsonLikeFormatter. Uses the internal method to parse YAML strings into Python objects, and then applies JSON-like formatting rules to extract desired fields.
Suitable for handling nested YAML content with formatter-based field selection.
Examples:
>>> from lazyllm.components.formatter import YamlFormatter
>>> formatter = YamlFormatter("{name,age}")
>>> msg = """
... name: Alice
... age: 30
... city: London
... """
>>> formatter(msg)
{'name': 'Alice', 'age': 30}
Source code in lazyllm/components/formatter/yamlformatter.py
lazyllm.components.formatter.encode_query_with_filepaths(query=None, files=None)
Encodes a query string together with associated file paths into a structured string format with context.
If file paths are provided, the query and file list will be wrapped into a JSON object prefixed with __lazyllm_docs__. Otherwise, it returns the original query string.
Parameters:
-
query(str, default:None) –The user query string. Defaults to an empty string.
-
files(str or List[str], default:None) –File path(s) associated with the query. Can be a single string or a list of strings.
Returns:
- str: A structured encoded query string or the raw query.
Raises:
-
AssertionError–If
filesis not a string or list of strings.
Examples:
>>> # Encode a query along with associated documentation files
>>> encode_query_with_filepaths("Generate questions based on the document", files=["a.md"])
'<lazyllm-query>{"query": "Generate questions based on the document", "files": ["a.md"]}'
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.decode_query_with_filepaths(query_files)
Decodes a structured query string into a dictionary containing the original query and file paths.
If the input string starts with the special prefix __lazyllm_docs__, it attempts to parse the JSON content; otherwise, it returns the raw query string as-is.
Parameters:
-
query_files(str) –The encoded query string that may include both query and file paths.
Returns:
- Union[dict, str]: A dictionary containing 'query' and 'files' if structured, otherwise the original query string.
Raises:
-
AssertionError–If the input is not a string.
-
ValueError–If the string is prefixed but JSON decoding fails.
Examples:
>>> # Decode a structured query with files
>>> decode_query_with_filepaths('<lazyllm-query>{"query": "Summarize the content", "files": ["doc.md"]}')
{'query': 'Summarize the content', 'files': ['doc.md']}
>>> # Decode a plain string without files
>>> decode_query_with_filepaths("This is just a simple question")
'This is just a simple question'
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.lazyllm_merge_query(*args)
Merges multiple query strings (potentially with associated file paths) into a single structured query string.
Each argument can be a plain query string or a structured query created by encode_query_with_filepaths. The function decodes each input, concatenates all query texts, and merges the associated file paths. The final result is re-encoded into a single query string with unified context.
Parameters:
-
*args(str, default:()) –Multiple query strings. Each can be either plain text or an encoded structured query with files.
Returns:
- str: A single structured query string containing the merged query and file paths.
Examples:
>>> # Merge two structured queries with English content and associated files
>>> q1 = encode_query_with_filepaths("Please summarize document one", files=["doc1.md"])
>>> q2 = encode_query_with_filepaths("Add details from document two", files=["doc2.md"])
>>> lazyllm_merge_query(q1, q2)
'<lazyllm-query>{"query": "Please summarize document oneAdd details from document two", "files": ["doc1.md", "doc2.md"]}'
>>> # Merge plain English text queries without documents
>>> lazyllm_merge_query("What is AI?", "Explain deep learning.")
'What is AI?Explain deep learning.'
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.formatterbase.JsonLikeFormatter
Bases: LazyLLMFormatterBase
This class is used to extract subfields from nested structures (like dicts, lists, tuples) using a JSON-like indexing syntax.
The behavior is driven by a formatter string similar to Python-style slicing and dictionary access:
[0]fetches the first item[0][{key}]accesses thekeyfield in the first item[0,1][{a,b}]fetches theaandbfields from the first and second items[::2]does slicing with a step of 2*[0][{x}]means return a wrapped/structured result
Parameters:
-
formatter(str, default:None) –A format string that controls how to slice and extract the structure. If None, the input will be returned directly.
Examples:
>>> from lazyllm.components.formatter.formatterbase import JsonLikeFormatter
>>> formatter = JsonLikeFormatter("[{a,b}]")
Source code in lazyllm/components/formatter/formatterbase.py
128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 | |
lazyllm.components.formatter.formatterbase.PythonFormatter
Bases: JsonLikeFormatter
Reserved formatter class for supporting Python-style data extraction syntax. To be developed.
Currently inherits from JsonLikeFormatter with no additional behavior.
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.FileFormatter
Bases: LazyLLMFormatterBase
A formatter that transforms query strings with document context between structured formats.
Supports three modes:
- "decode": Decodes structured query strings into dictionaries with
queryandfiles. - "encode": Encodes a dictionary with
queryandfilesinto a structured query string. - "merge": Merges multiple structured query strings into one.
Parameters:
-
formatter(str, default:'decode') –The operation mode. Must be one of "decode", "encode", or "merge". Defaults to "decode".
Examples:
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.YamlFormatter
Bases: JsonLikeFormatter
A formatter for extracting structured information from YAML-formatted strings.
Inherits from JsonLikeFormatter. Uses the internal method to parse YAML strings into Python objects, and then applies JSON-like formatting rules to extract desired fields.
Suitable for handling nested YAML content with formatter-based field selection.
Examples:
>>> from lazyllm.components.formatter import YamlFormatter
>>> formatter = YamlFormatter("{name,age}")
>>> msg = """
... name: Alice
... age: 30
... city: London
... """
>>> formatter(msg)
{'name': 'Alice', 'age': 30}
Source code in lazyllm/components/formatter/yamlformatter.py
lazyllm.components.formatter.encode_query_with_filepaths(query=None, files=None)
Encodes a query string together with associated file paths into a structured string format with context.
If file paths are provided, the query and file list will be wrapped into a JSON object prefixed with __lazyllm_docs__. Otherwise, it returns the original query string.
Parameters:
-
query(str, default:None) –The user query string. Defaults to an empty string.
-
files(str or List[str], default:None) –File path(s) associated with the query. Can be a single string or a list of strings.
Returns:
- str: A structured encoded query string or the raw query.
Raises:
-
AssertionError–If
filesis not a string or list of strings.
Examples:
>>> # Encode a query along with associated documentation files
>>> encode_query_with_filepaths("Generate questions based on the document", files=["a.md"])
'<lazyllm-query>{"query": "Generate questions based on the document", "files": ["a.md"]}'
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.decode_query_with_filepaths(query_files)
Decodes a structured query string into a dictionary containing the original query and file paths.
If the input string starts with the special prefix __lazyllm_docs__, it attempts to parse the JSON content; otherwise, it returns the raw query string as-is.
Parameters:
-
query_files(str) –The encoded query string that may include both query and file paths.
Returns:
- Union[dict, str]: A dictionary containing 'query' and 'files' if structured, otherwise the original query string.
Raises:
-
AssertionError–If the input is not a string.
-
ValueError–If the string is prefixed but JSON decoding fails.
Examples:
>>> # Decode a structured query with files
>>> decode_query_with_filepaths('<lazyllm-query>{"query": "Summarize the content", "files": ["doc.md"]}')
{'query': 'Summarize the content', 'files': ['doc.md']}
>>> # Decode a plain string without files
>>> decode_query_with_filepaths("This is just a simple question")
'This is just a simple question'
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.lazyllm_merge_query(*args)
Merges multiple query strings (potentially with associated file paths) into a single structured query string.
Each argument can be a plain query string or a structured query created by encode_query_with_filepaths. The function decodes each input, concatenates all query texts, and merges the associated file paths. The final result is re-encoded into a single query string with unified context.
Parameters:
-
*args(str, default:()) –Multiple query strings. Each can be either plain text or an encoded structured query with files.
Returns:
- str: A single structured query string containing the merged query and file paths.
Examples:
>>> # Merge two structured queries with English content and associated files
>>> q1 = encode_query_with_filepaths("Please summarize document one", files=["doc1.md"])
>>> q2 = encode_query_with_filepaths("Add details from document two", files=["doc2.md"])
>>> lazyllm_merge_query(q1, q2)
'<lazyllm-query>{"query": "Please summarize document oneAdd details from document two", "files": ["doc1.md", "doc2.md"]}'
>>> # Merge plain English text queries without documents
>>> lazyllm_merge_query("What is AI?", "Explain deep learning.")
'What is AI?Explain deep learning.'
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.JsonFormatter
Bases: JsonLikeFormatter
This class is a JSON formatter, that is, the user wants the model to output content is JSON format, and can also select a field in the output content by indexing.
Examples:
>>> import lazyllm
>>> from lazyllm.components import JsonFormatter
>>> toc_prompt='''
... You are now an intelligent assistant. Your task is to understand the user's input and convert the outline into a list of nested dictionaries. Each dictionary contains a `title` and a `describe`, where the `title` should clearly indicate the level using Markdown format, and the `describe` is a description and writing guide for that section.
...
... Please generate the corresponding list of nested dictionaries based on the following user input:
...
... Example output:
... [
... {
... "title": "# Level 1 Title",
... "describe": "Please provide a detailed description of the content under this title, offering background information and core viewpoints."
... },
... {
... "title": "## Level 2 Title",
... "describe": "Please provide a detailed description of the content under this title, giving specific details and examples to support the viewpoints of the Level 1 title."
... },
... {
... "title": "### Level 3 Title",
... "describe": "Please provide a detailed description of the content under this title, deeply analyzing and providing more details and data support."
... }
... ]
... User input is as follows:
... '''
>>> query = "Please help me write an article about the application of artificial intelligence in the medical field."
>>> m = lazyllm.TrainableModule("internlm2-chat-20b").prompt(toc_prompt).start()
>>> ret = m(query, max_new_tokens=2048)
>>> print(f"ret: {ret!r}") # the model output without specifying a formatter
'Based on your user input, here is the corresponding list of nested dictionaries:
[
{
"title": "# Application of Artificial Intelligence in the Medical Field",
"describe": "Please provide a detailed description of the application of artificial intelligence in the medical field, including its benefits, challenges, and future prospects."
},
{
"title": "## AI in Medical Diagnosis",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical diagnosis, including specific examples of AI-based diagnostic tools and their impact on patient outcomes."
},
{
"title": "### AI in Medical Imaging",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical imaging, including the advantages of AI-based image analysis and its applications in various medical specialties."
},
{
"title": "### AI in Drug Discovery and Development",
"describe": "Please provide a detailed description of how artificial intelligence is used in drug discovery and development, including the role of AI in identifying potential drug candidates and streamlining the drug development process."
},
{
"title": "## AI in Medical Research",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical research, including its applications in genomics, epidemiology, and clinical trials."
},
{
"title": "### AI in Genomics and Precision Medicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in genomics and precision medicine, including the role of AI in analyzing large-scale genomic data and tailoring treatments to individual patients."
},
{
"title": "### AI in Epidemiology and Public Health",
"describe": "Please provide a detailed description of how artificial intelligence is used in epidemiology and public health, including its applications in disease surveillance, outbreak prediction, and resource allocation."
},
{
"title": "### AI in Clinical Trials",
"describe": "Please provide a detailed description of how artificial intelligence is used in clinical trials, including its role in patient recruitment, trial design, and data analysis."
},
{
"title": "## AI in Medical Practice",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical practice, including its applications in patient monitoring, personalized medicine, and telemedicine."
},
{
"title": "### AI in Patient Monitoring",
"describe": "Please provide a detailed description of how artificial intelligence is used in patient monitoring, including its role in real-time monitoring of vital signs and early detection of health issues."
},
{
"title": "### AI in Personalized Medicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in personalized medicine, including its role in analyzing patient data to tailor treatments and predict outcomes."
},
{
"title": "### AI in Telemedicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in telemedicine, including its applications in remote consultations, virtual diagnoses, and digital health records."
},
{
"title": "## AI in Medical Ethics and Policy",
"describe": "Please provide a detailed description of the ethical and policy considerations surrounding the use of artificial intelligence in the medical field, including issues related to data privacy, bias, and accountability."
}
]'
>>> m = lazyllm.TrainableModule("internlm2-chat-20b").formatter(JsonFormatter("[:][title]")).prompt(toc_prompt).start()
>>> ret = m(query, max_new_tokens=2048)
>>> print(f"ret: {ret}") # the model output of the specified formaater
['# Application of Artificial Intelligence in the Medical Field', '## AI in Medical Diagnosis', '### AI in Medical Imaging', '### AI in Drug Discovery and Development', '## AI in Medical Research', '### AI in Genomics and Precision Medicine', '### AI in Epidemiology and Public Health', '### AI in Clinical Trials', '## AI in Medical Practice', '### AI in Patient Monitoring', '### AI in Personalized Medicine', '### AI in Telemedicine', '## AI in Medical Ethics and Policy']
Source code in lazyllm/components/formatter/jsonformatter.py
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 | |
lazyllm.components.EmptyFormatter
Bases: LazyLLMFormatterBase
This type is the system default formatter. When the user does not specify anything or does not want to format the model output, this type is selected. The model output will be in the same format.
Examples:
>>> import lazyllm
>>> from lazyllm.components import EmptyFormatter
>>> toc_prompt='''
... You are now an intelligent assistant. Your task is to understand the user's input and convert the outline into a list of nested dictionaries. Each dictionary contains a `title` and a `describe`, where the `title` should clearly indicate the level using Markdown format, and the `describe` is a description and writing guide for that section.
...
... Please generate the corresponding list of nested dictionaries based on the following user input:
...
... Example output:
... [
... {
... "title": "# Level 1 Title",
... "describe": "Please provide a detailed description of the content under this title, offering background information and core viewpoints."
... },
... {
... "title": "## Level 2 Title",
... "describe": "Please provide a detailed description of the content under this title, giving specific details and examples to support the viewpoints of the Level 1 title."
... },
... {
... "title": "### Level 3 Title",
... "describe": "Please provide a detailed description of the content under this title, deeply analyzing and providing more details and data support."
... }
... ]
... User input is as follows:
... '''
>>> query = "Please help me write an article about the application of artificial intelligence in the medical field."
>>> m = lazyllm.TrainableModule("internlm2-chat-20b").prompt(toc_prompt).start() # the model output without specifying a formatter
>>> ret = m(query, max_new_tokens=2048)
>>> print(f"ret: {ret!r}")
'Based on your user input, here is the corresponding list of nested dictionaries:
[
{
"title": "# Application of Artificial Intelligence in the Medical Field",
"describe": "Please provide a detailed description of the application of artificial intelligence in the medical field, including its benefits, challenges, and future prospects."
},
{
"title": "## AI in Medical Diagnosis",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical diagnosis, including specific examples of AI-based diagnostic tools and their impact on patient outcomes."
},
{
"title": "### AI in Medical Imaging",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical imaging, including the advantages of AI-based image analysis and its applications in various medical specialties."
},
{
"title": "### AI in Drug Discovery and Development",
"describe": "Please provide a detailed description of how artificial intelligence is used in drug discovery and development, including the role of AI in identifying potential drug candidates and streamlining the drug development process."
},
{
"title": "## AI in Medical Research",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical research, including its applications in genomics, epidemiology, and clinical trials."
},
{
"title": "### AI in Genomics and Precision Medicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in genomics and precision medicine, including the role of AI in analyzing large-scale genomic data and tailoring treatments to individual patients."
},
{
"title": "### AI in Epidemiology and Public Health",
"describe": "Please provide a detailed description of how artificial intelligence is used in epidemiology and public health, including its applications in disease surveillance, outbreak prediction, and resource allocation."
},
{
"title": "### AI in Clinical Trials",
"describe": "Please provide a detailed description of how artificial intelligence is used in clinical trials, including its role in patient recruitment, trial design, and data analysis."
},
{
"title": "## AI in Medical Practice",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical practice, including its applications in patient monitoring, personalized medicine, and telemedicine."
},
{
"title": "### AI in Patient Monitoring",
"describe": "Please provide a detailed description of how artificial intelligence is used in patient monitoring, including its role in real-time monitoring of vital signs and early detection of health issues."
},
{
"title": "### AI in Personalized Medicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in personalized medicine, including its role in analyzing patient data to tailor treatments and predict outcomes."
},
{
"title": "### AI in Telemedicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in telemedicine, including its applications in remote consultations, virtual diagnoses, and digital health records."
},
{
"title": "## AI in Medical Ethics and Policy",
"describe": "Please provide a detailed description of the ethical and policy considerations surrounding the use of artificial intelligence in the medical field, including issues related to data privacy, bias, and accountability."
}
]'
>>> m = lazyllm.TrainableModule("internlm2-chat-20b").formatter(EmptyFormatter()).prompt(toc_prompt).start() # the model output of the specified formatter
>>> ret = m(query, max_new_tokens=2048)
>>> print(f"ret: {ret!r}")
'Based on your user input, here is the corresponding list of nested dictionaries:
[
{
"title": "# Application of Artificial Intelligence in the Medical Field",
"describe": "Please provide a detailed description of the application of artificial intelligence in the medical field, including its benefits, challenges, and future prospects."
},
{
"title": "## AI in Medical Diagnosis",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical diagnosis, including specific examples of AI-based diagnostic tools and their impact on patient outcomes."
},
{
"title": "### AI in Medical Imaging",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical imaging, including the advantages of AI-based image analysis and its applications in various medical specialties."
},
{
"title": "### AI in Drug Discovery and Development",
"describe": "Please provide a detailed description of how artificial intelligence is used in drug discovery and development, including the role of AI in identifying potential drug candidates and streamlining the drug development process."
},
{
"title": "## AI in Medical Research",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical research, including its applications in genomics, epidemiology, and clinical trials."
},
{
"title": "### AI in Genomics and Precision Medicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in genomics and precision medicine, including the role of AI in analyzing large-scale genomic data and tailoring treatments to individual patients."
},
{
"title": "### AI in Epidemiology and Public Health",
"describe": "Please provide a detailed description of how artificial intelligence is used in epidemiology and public health, including its applications in disease surveillance, outbreak prediction, and resource allocation."
},
{
"title": "### AI in Clinical Trials",
"describe": "Please provide a detailed description of how artificial intelligence is used in clinical trials, including its role in patient recruitment, trial design, and data analysis."
},
{
"title": "## AI in Medical Practice",
"describe": "Please provide a detailed description of how artificial intelligence is used in medical practice, including its applications in patient monitoring, personalized medicine, and telemedicine."
},
{
"title": "### AI in Patient Monitoring",
"describe": "Please provide a detailed description of how artificial intelligence is used in patient monitoring, including its role in real-time monitoring of vital signs and early detection of health issues."
},
{
"title": "### AI in Personalized Medicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in personalized medicine, including its role in analyzing patient data to tailor treatments and predict outcomes."
},
{
"title": "### AI in Telemedicine",
"describe": "Please provide a detailed description of how artificial intelligence is used in telemedicine, including its applications in remote consultations, virtual diagnoses, and digital health records."
},
{
"title": "## AI in Medical Ethics and Policy",
"describe": "Please provide a detailed description of the ethical and policy considerations surrounding the use of artificial intelligence in the medical field, including issues related to data privacy, bias, and accountability."
}
]'
Source code in lazyllm/components/formatter/formatterbase.py
230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 | |
lazyllm.components.FunctionCallFormatter
Bases: LazyLLMFormatterBase
Function call formatter for processing message dictionaries containing function call information.
This formatter is specifically designed for handling model outputs in function calling scenarios. It extracts only the 'role', 'content', and 'tool_calls' fields from the input dictionary, filtering out other unnecessary fields.
Primarily used in function calling-related modules such as FunctionCall.
Note
- Input must be a dictionary type, otherwise an assertion error will be raised
- Only preserves 'role', 'content', and 'tool_calls' fields if they exist in the dictionary
Examples:
>>> from lazyllm.components.formatter.formatterbase import FunctionCallFormatter
>>> formatter = FunctionCallFormatter()
>>>
>>> # 处理包含函数调用的消息
>>> msg = {
... 'role': 'assistant',
... 'content': 'I will call a function to get the weather.',
... 'tool_calls': [
... {
... 'id': 'call_123',
... 'type': 'function',
... 'function': {
... 'name': 'get_weather',
... 'arguments': '{"location": "Beijing"}'
... }
... }
... ],
... 'other_field': 'will be filtered'
... }
>>> result = formatter.format(msg)
>>> print(result)
{'role': 'assistant', 'content': 'I will call a function to get the weather.', 'tool_calls': [{'id': 'call_123', 'type': 'function', 'function': {'name': 'get_weather', 'arguments': '{"location": "Beijing"}'}}]}
>>>
>>> # 处理只包含部分字段的消息
>>> msg2 = {
... 'role': 'assistant',
... 'content': 'Hello, how can I help you?'
... }
>>> result2 = formatter.format(msg2)
>>> print(result2)
{'role': 'assistant', 'content': 'Hello, how can I help you?'}
Source code in lazyllm/components/formatter/formatterbase.py
lazyllm.components.formatter.formatterbase.PipelineFormatter
Bases: LazyLLMFormatterBase
Pipeline formatter for encapsulating data processing pipelines as formatters.
This class wraps Pipeline instances as formatters and supports combining multiple formatters through pipe operators.
Parameters:
-
formatter(Pipeline) –Pipeline instance to encapsulate
Source code in lazyllm/components/formatter/formatterbase.py
ComponentBase
lazyllm.components.core.ComponentBase
Bases: object
Base class for components, providing a unified interface and basic implementation to facilitate creation of various components.
Components execute tasks via a specified launcher and support custom task execution logic.
Parameters:
-
launcher(LazyLLMLaunchersBase or type, default:empty()) –Launcher instance or launcher class used by the component, defaults to empty launcher.
Examples:
>>> from lazyllm.components.core import ComponentBase
>>> class MyComponent(ComponentBase):
... def apply(self, x):
... return x * 2
>>> comp = MyComponent()
>>> comp.name = "ExampleComponent"
>>> print(comp.name)
ExampleComponent
>>> result = comp(10)
>>> print(result)
20
>>> print(comp.apply(5))
10
Source code in lazyllm/components/core.py
apply()
Core execution method of the component, to be implemented by subclasses.
Defines the specific business logic or task execution steps of the component.
Note:
If this method is overridden by the subclass, it will be called when the component is invoked.
Source code in lazyllm/components/core.py
cmd(*args, **kw)
Generates the execution command of the component, to be implemented by subclasses.
The returned command can be a string, tuple, or list, representing the instruction to execute the task.
Note:
If the apply method is not overridden, this command will be used to create a job for the launcher to run.
Source code in lazyllm/components/core.py
lazyllm.components.deploy.ray.Distributed
Bases: LazyLLMDeployBase
Distributed deployment class, inherits from LazyLLMDeployBase.
Provides distributed model deployment functionality based on Ray framework, supports multi-node cluster deployment.
Parameters:
-
launcher–Launcher configuration, defaults to remote launcher(ngpus=1)
-
port(int, default:None) –Service port number, defaults to random port(30000-40000)
Attributes:
-
finetuned_model–Fine-tuned model path
-
base_model–Base model path
-
master_ip–Master node IP address
Methods:
-
cmd–Generate deployment command
-
geturl–Get deployed service URL address
Source code in lazyllm/components/deploy/ray.py
cmd(finetuned_model=None, base_model=None, master_ip=None)
Generate Ray distributed deployment command.
Generate corresponding Ray startup command based on whether it is a master node, supports both head node and worker node modes.
Parameters:
-
finetuned_model–Fine-tuned model path
-
base_model–Base model path
-
master_ip–Master node IP address, if empty starts as head node
Returns:
-
LazyLLMCMD–Object containing deployment command