fedem.utils package

Submodules

fedem.utils.huggingface.get_client_details(hf_token: str | None = None) → tuple[source]

Get the client details from the Hugging Face API.

Parameters:: hf_token (str, optional) – The Hugging Face API token. Defaults to None.
Returns:: The Hugging Face API client and the user details.
Return type:: tuple | None

fedem.utils.huggingface.verify_user_with_org(client_details: dict, org_id: str, access_level: list = ['contributor']) → dict[source]

Verify if the user is part of the organization.

Parameters:

Returns:

The org details if the user is part of the organization, else None.

Return type:

dict | None

fedem.utils.get_checkpoint_model(model_name)[source]

Get the checkpoint model based on the model name and organization ID.

fedem.utils.load_data(data_path)[source]

Load dataset from a given path and split it into train and validation sets.

fedem.utils.load_json(json_path)[source]

Load JSON data from a file.

fedem.utils.load_model(config)[source]

Load a model based on the provided configuration.

fedem.utils.load_model_pretrained(config)[source]

Load a pre-trained model based on the provided configuration.

fedem.utils.load_model_with_LoRA(model, target_modules, local_path)[source]

Load a model with LoRA (Low-Rank Adaptation) applied.

Parameters:

Returns:

Model with LoRA applied.

Return type:

MambaForCausalLM

fedem.utils.load_tokenizer(path)[source]

Load tokenizer from a given path.

fedem.utils.print_trainable_parameters(model)[source]

Print the number of trainable parameters in the model.

fedem.utils.split_data(data)[source]

Split dataset into train and validation sets.