Job Manager Service- 示例
运行模型
输入参数
modelId
:定义模型的 ID,该 ID 必须支持相关租户使用 Model Management Service 访问。
支持的模型
目前仅支持 Apache Zeppelin 笔记本。
configurationId
:使用 Predictive Learning Services 定义已创建的环境配置。作业管理器将环境实例化,并在其中运行模型。inputFolderId
:定义存储输入数据的文件夹,该文件夹必须支持相关租户使用 Data Exchange Service 进行访问。outputFolderId
:定义输出文件夹,该文件夹必须已创建并且支持相关租户使用 Data Exchange Service 进行访问。maximumExecutionTimeInSeconds
:定义允许的最大作业执行时间。仅考虑执行引擎 (Apache Zeppelin) 的实际执行时间。
启动作业
请求:
POST /api/jobmanager/v3/jobs
X-XSRF-TOKEN: `<xsrf_token>`
输入参数在正文中定义:
{
"modelId": "<modelId>",
"configurationId": "<configurationId>",
"inputFolderId": "<inputFolderId>",
"outputFolderId": "<outputFolderId>",
"maximumExecutionTimeInSeconds":"7200"
}
示例响应:
{
"id": "<job_execution_id>",
"modelId": "<modelId>",
"environmentId": "<environmentId>",
"message": "",
"status":"SUBMITTED",
"creationDate":"2018-10-01T12:00:00.001Z",
"createdBy": "<your_tenant_id>",
"inputFolderId": "<inputFolderId>",
"outputFolderId": "<outputFolderId>",
"configurationId": "<configurationId>",
"maximumExecutionTimeInSeconds":"7200"
}
<job_execution_id>
在使用作业管理器监视作业时是必要的。
监视作业执行
请使用以下代码请求作业当前状态:
GET /api/jobmanager/v3/jobs/<job_execution_id>
示例响应:
{
"id": "<job_execution_id>",
"modelId": "<modelId>",
"environmentId": "<environmentId>",
"message":"Started notebook execution.",
"status":"RUNNING",
"creationDate":"2018-10-01T02:00:00.001Z",
"createdBy": "<your_tenant_id>",
"inputFolderId": "<inputFolderId>",
"outputFolderId": "<outputFolderId>",
"configurationId": "<configurationId>",
"maximumExecutionTimeInSeconds":"7200"
}
将提供以下状态:
SUBMITTED
STARTING
RUNNING
STOPPING
FAILED
SUCCEDED
STOPPED
如果作业执行无法完成或者执行期间发生错误,则状态为 FAILED
。如果 Zeppelin 笔记本中单个段落执行失败而所有其他程序步执行成功,也会出现此状态。 如果执行流程允许跳过单个程序步,作业管理器会尝试继续作业执行。
检索作业列表
使用以下请求可检索全部可用作业列表:
GET /api/jobmanager/v3/jobs
默认情况下,作业管理器将结果分为多页,每页含 100 个条目,并且返回第一页。使用查询参数 <pageNumber>
可更改要返回的页码。查询参数 <pageSize>
可更改每页的条目数。
响应:
{
"content": [
{
"jobId": "<job_execution_id>",
"modelId": "<modelId>",
"environmentId": <environmentId>,
"message": "message",
"status":"SUCCEEDED",
"creationDate":"2018-10-01T12:00:00.001Z",
"createdBy":"TenantId",
"inputFolderId": "<inputFolderId>",
"outputFolderId": "<outputFolderId>",
"configurationId": "<configurationId>",
"maximumExecutionTimeInSeconds":"7200"
},
{
"jobId": "<job_execution_id>",
"modelId": "<modelId>",
"environmentId": <environmentId>,
"message": "Unable to import model into Zeppelin[I/O error on GET request for \"https://gateway.{region}.{mindsphere-domain}/api/modelmanagement/v3/models/<model_id>/versions/last\": Server returned HTTP response code: 504 for URL: https://gateway.{region}.{mindsphere-domain}/api/modelmanagement/v3/models/<model_id>/versions/last; nested exception is java.io.IOException: Server returned HTTP response code: 504 for URL: https://gateway.{region}.{mindsphere-domain}/api/modelmanagement/v3/models/Model_id/versions/last] Error while stopping environment",
"status":"STOPPING",
"creationDate":"2018-10-01T12:00:00.001Z",
"createdBy":"TenantId",
"inputFolderId": "<inputFolderId>",
"outputFolderId": "<outputFolderId>",
"configurationId": "<configurationId>",
"maximumExecutionTimeInSeconds":"7200"
},
{
"jobId": "<job_execution_id>",
"modelId": "<modelId>",
"environmentId": <environmentId>,
"message":"Failed to start environment [An environment has already been started for the configuration]",
"status":"FAILED",
"creationDate":"2018-10-02T12:00:00.001Z",
"createdBy":"TenantId",
"inputFolderId": "<inputFolderId>",
"outputFolderId": "<outputFolderId>",
"configurationId": "<configurationId>",
"maximumExecutionTimeInSeconds":"7200"
}
],
"totalPages":1,
"totalElements":3,
"last":true,
"size":20,
"number":0,
"numberOfElements":3,
"first":true,
"sort":null
}
第二和第三个条目说明,当发生致命错误且执行逻辑无法恢复时,会将消息链返回给用户。作业管理器尝试停止使用的实例,同时保留可能已经产生的全部输出。
还有问题?
除非另行声明,该网站内容遵循MindSphere开发许可协议.
Last update: May 31, 2019