Current Limitations
Please read the following carefully concerning current limitations with Batch Shipyard and Docker-enabled compute pools. If using low-priority compute nodes, please also refer to the Low Priority Compute Node Considerations document.
Azure Batch and Batch Shipyard Command Restrictions
The following Azure Batch actions should only be performed through Batch Shipyard when deploying your workload through this toolkit as Batch Shipyard needs to take special actions or ensure the intended outcome:
- Non-
native
container pools only:- Task termination (if task is running): use
jobs tasks term
- Task deletion (if task is running): use
jobs tasks del
- Job termination (if any tasks are running in the job): use the
--termtasks
option withjobs term
- Job deletion (if any tasks are running in the job): use the
--termtasks
option withjobs del
- Task termination (if task is running): use
- Pool resize: use
pool resize
- Pool deletion: use
pool del
Additionally, you cannot add Batch Shipyard tasks to a non-Batch Shipyard
allocated pool since all of the preparation for each compute node will not
be present in those pools. Please use pool add
with your pool specification
to create compute resources to execute your Batch Shipyard jobs against.
General Limitations and Restrictions
- Tasks can have a maximum lifetime of 180 days (including time spent enqueued in active state and for data movement).
- For non-
native
container pools, it is recommended to provision an SSH user to aid in client-side assisted task termination and other tasks that may require direct SSH access. - Docker container images used for multi-instance tasks (including MPI jobs)
must have
bash
installed. - SSH tunnel script generation is only compatible with non-Windows machines.
- Data movement support on Windows is restricted to scp. Both
ssh.exe
andscp.exe
must be found through%PATH%
or in the current working directory. Rsync is not supported in Windows. pool ssh
support in Windows is only available ifssh.exe
is found through%PATH%
or is in the current working directory.pool images list
currently only lists Docker images.- Credential encryption support in Windows is available only if
openssl.exe
is found through%PATH%
or is in the current working directory. - Data movement between Batch tasks as defined by
input_data
:azure_batch
is restricted to Batch accounts with keys (non-AAD). - Certain features, configuration options and usage commands are not supported with Windows compute node pools such as on-premises to compute node direct data ingress, storage cluster mounting, and credential encryption support.
- Adding tasks to the same job across multiple, concurrent Batch Shipyard invocations may result in failure if task ids for these jobs are auto-generated.
per_job_auto_scratch
andauto_scratch
are not intended to be used on autoscale pools.- The ephemeral mount point cannot be changed from the default on custom images.
- Hc/Hb VM sizes have some additional known issues which are detailed in this document.
- Encrypted Singularity container images must be pre-loaded, i.e., defined
in the
global_resources
section during Batch pool creation time.
Monitoring Limitations
- Only Linux Batch pools and RemoteFS clusters can be monitored. Windows Batch pools are not supported.