Train 3D images

Short Description

The function trains a deep learning model (autoencoder) for selected marker/markers in the provided training data. To train the ae3dTrain or ae3dTrainMulti model, simply direct the function to the dataset_dir folder.

`ae3dTrain(dataset_dir, outModelPath, input_channels=1, output_channels=1, embedding_size=256, max_epoch_num=100, batch_size=8, num_workers=10, prefetch_factor=8)` ¶

dataset_dir (str): The file path leading to the directory that holds the training data.

outModelPath (str): Path to output directory for saving the trained model.

input_channels (int, optional): encoder input channels, (assumed to be 1).

output_channels (int, optional): decoder output channels (assumed to be 1).

embedding_size (int, optional): encoding dimension for expected extracting embedding.

max_epoch_num (int, optional): max epoch for the training.

batch_size (int, optional): batch size for dataloader.

num_workers (int, optional): how many subprocesses to use for data loading. 0 means that the data will be loaded in the main process.

prefetch_factor (int, optional): Number of batches loaded in advance by each worker.

Example:

input_channels = 1
output_channels = 1
embedding_size = 256
dataset_dir="/n/scratch/users/r/roh6824/Results/LSP13626_DNA_padding/SpatialAE/Single3DPatch/DNA1/"
outModelPath = '/n/scratch/users/r/roh6824/Results/LSP13626_DNA_padding/SpatialAE/ln_3Dautoencoder_DNA_validate_300_model_update.pth'

ae3dTrain(dataset_dir, outModelPath,  input_channels, output_channels, embedding_size, max_epoch_num, batch_size)

Source code in spatialae/models/ae3dTrain.py

def ae3dTrain(dataset_dir,
              outModelPath,
              input_channels = 1,
              output_channels = 1, 
              embedding_size = 256,
              max_epoch_num = 100,
              batch_size = 8,
              num_workers = 10,
              prefetch_factor = 8
              ):
    """
    Parameters:
    dataset_dir (str):
        The file path leading to the directory that holds the training data.

    outModelPath (str):
        Path to output directory for saving the trained model. 

    input_channels (int, optional):
        encoder input channels, (assumed to be 1).

    output_channels (int, optional):
        decoder output channels (assumed to be 1).

    embedding_size (int, optional):
        encoding dimension for expected extracting embedding.

    max_epoch_num (int, optional):
        max epoch for the training.

    batch_size (int, optional):
        batch size for dataloader.

    num_workers (int, optional):
        how many subprocesses to use for data loading. 0 means that the data will be loaded in the main process.

    prefetch_factor (int, optional):
         Number of batches loaded in advance by each worker.


    Example:
    ```python

    input_channels = 1
    output_channels = 1
    embedding_size = 256
    dataset_dir="/n/scratch/users/r/roh6824/Results/LSP13626_DNA_padding/SpatialAE/Single3DPatch/DNA1/"
    outModelPath = '/n/scratch/users/r/roh6824/Results/LSP13626_DNA_padding/SpatialAE/ln_3Dautoencoder_DNA_validate_300_model_update.pth'

    ae3dTrain(dataset_dir, outModelPath,  input_channels, output_channels, embedding_size, max_epoch_num, batch_size)
    ```

    """
    # model = spatialae.models.LitAutoEncoder3D_update(input_channels, output_channels, embedding_size)
    model = LitAutoEncoder3D_Complex(input_channels, output_channels, embedding_size)

    # Instantiate the dataset
    transform = ToTensor3D()
    train_dataset = Spatial3DImageDataset(dataset_dir, transform=transform, get_train = True)
    train_loader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True, num_workers = num_workers, prefetch_factor = prefetch_factor)

    validate_dataset = Spatial3DImageDataset(dataset_dir, transform=transform, get_validate = True)
    validate_loader = DataLoader(validate_dataset, batch_size=batch_size, shuffle=False, num_workers = num_workers, prefetch_factor = prefetch_factor)

    trainer = pl.Trainer(max_epochs=max_epoch_num)
    trainer.fit(model, train_loader, validate_loader)

    # save the trained model
    torch.save(model.state_dict(), outModelPath)

`ae3dTrainMulti(dataset_dir, outModelPath, channels, embedding_size=256, max_epoch_num=100, batch_size=8, num_workers=10, prefetch_factor=8)` ¶