[SDK] Support Docker image as objective in the `tune` API #2326

andreyvelich · 2024-05-06T15:13:32Z

Ref discussion: kubeflow/website#3723 (comment).

Currently, user can only pass the training function as objective in the tune API in Katib Python SDK.

Similar to create_job API in Training Python SDK, we should give user an ability to set objective as Docker image.

/area sdk
/good-first-issue

The text was updated successfully, but these errors were encountered:

google-oss-prow · 2024-05-06T15:13:34Z

@andreyvelich:
This request has been marked as suitable for new contributors.

Please ensure the request meets the requirements listed here.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-good-first-issue command.

In response to this:

Ref discussion: kubeflow/website#3723 (comment).

Currently, user can only pass the training function as objective in the tune API in Katib Python SDK.

Similar to create_job API in Training Python SDK, we should give user an ability to set objective as Docker image.

/area sdk
/good-first-issue

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

akhilsaivenkata · 2024-05-20T03:45:52Z

/assign

akhilsaivenkata · 2024-05-20T23:15:28Z

Hi @andreyvelich , I am new to kubeflow community and I have been going through the Ref discussion as well as code snippets of tune API and create job API. This is my understanding:

create_job API creates the job using one of the following options:
- Define custom resource object in job parameter (e.g. TFJob or PyTorchJob).
- Define training function in train_func parameter and number of workers.
- Define Docker image in base_image parameter and number of workers.
So if train_func and custom resource are not provided then method takes base_image and tries to create the job and its template without train_func.
Now we are looking to have similar functionality in tune API by giving user ability to use Docker image instead of callable function to tune the hyperparameters.
After looking at the tune API method parameters, I can see base_image as one of the parameters which is already taking Docker image as input(constants.BASE_IMAGE_TENSORFLOW image as default). So I wonder If we could make the objective parameter as an optional parameter where it takes 'None' value if no callable function is passed and make the tune API execute the steps with provided Docker image as base_image .

I would like to know whether I am on the right page or not. Please correct me If I am wrong.

andreyvelich · 2024-05-28T14:04:17Z

HI @akhilsaivenkata, yes, you are absolutely right for the TrainingClient. Also, we are planning to add target_image to the create_job in the future to build training image before job creation: kubeflow/training-operator#1878

So I wonder If we could make the objective parameter as an optional parameter where it takes 'None' value if no callable function is passed and make the tune API execute the steps with provided Docker image as base_image .

What do you think about re-using objective parameter to pass the Docker image ? In that case, we will just omit base_image and use image for Trial from objective parameter.
In the future, we can give users ability to set more parameters in objective (e.g. Git repo, tarball file).
What do you think @kubeflow/wg-training-leads @akhilsaivenkata @droctothorpe ?

akhilsaivenkata · 2024-05-28T14:43:01Z

HI @akhilsaivenkata, yes, you are absolutely right for the TrainingClient. Also, we are planning to add target_image to the create_job in the future to build training image before job creation: kubeflow/training-operator#1878

So I wonder If we could make the objective parameter as an optional parameter where it takes 'None' value if no callable function is passed and make the tune API execute the steps with provided Docker image as base_image .

What do you think about re-using objective parameter to pass the Docker image ? In that case, we will just omit base_image and use image for Trial from objective parameter. In the future, we can give users ability to set more parameters in objective (e.g. Git repo, tarball file). What do you think @kubeflow/wg-training-leads @akhilsaivenkata @droctothorpe ?

Thank you so much for your review @andreyvelich . If we have plans to give users the ability to set more parameters then I believe it would definitely be better option to go with your approach. If everyone is positive with this plan then I can proceed with implementation.

github-actions · 2024-08-26T15:05:15Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

andreyvelich · 2024-08-26T15:06:57Z

/remove-lifecycle stale

google-oss-prow bot added area/sdk good first issue Good for newcomers help wanted Extra attention is needed labels May 6, 2024

andreyvelich mentioned this issue May 6, 2024

Katib: Reorganized Katib Docs kubeflow/website#3723

Merged

google-oss-prow bot assigned akhilsaivenkata May 20, 2024

akhilsaivenkata linked a pull request May 30, 2024 that will close this issue

[SDK]Support Docker image as objective in the tune API #2338

Open

1 task

github-actions bot added the lifecycle/stale label Aug 26, 2024

google-oss-prow bot removed the lifecycle/stale label Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SDK] Support Docker image as objective in the `tune` API #2326

[SDK] Support Docker image as objective in the `tune` API #2326

andreyvelich commented May 6, 2024

google-oss-prow bot commented May 6, 2024

akhilsaivenkata commented May 20, 2024

akhilsaivenkata commented May 20, 2024

andreyvelich commented May 28, 2024

akhilsaivenkata commented May 28, 2024

github-actions bot commented Aug 26, 2024

andreyvelich commented Aug 26, 2024

[SDK] Support Docker image as objective in the tune API #2326

[SDK] Support Docker image as objective in the tune API #2326

Comments

andreyvelich commented May 6, 2024

google-oss-prow bot commented May 6, 2024

akhilsaivenkata commented May 20, 2024

akhilsaivenkata commented May 20, 2024

andreyvelich commented May 28, 2024

akhilsaivenkata commented May 28, 2024

github-actions bot commented Aug 26, 2024

andreyvelich commented Aug 26, 2024

[SDK] Support Docker image as objective in the `tune` API #2326

[SDK] Support Docker image as objective in the `tune` API #2326