tencent cloud

Tencent Cloud TI Platform

Product Introduction
Overview
Product Pricing
Benefits to Customers
Use Cases
Purchase Guide
Billing Overview
Purchase Mode
Renewal Instructions
Overdue Payment Instructions
Security Compliance
Data Security Protection Mechanism
Monitoring, Auditing, and Logging
Security Compliance Qualifications
Quick Start
Platform Usage Preparation
Operation Guide
Model Hub
Task-Based Modeling
Dev Machine
Model Management
Model Evaluation
Online Services
Resource Group Management
Managing Data Sources
Tikit
GPU Virtualization
Practical Tutorial
Deploying and Reasoning of LLM
LLM Training and Evaluation
Built-In Training Image List
Custom Training Image Specification
Angel Training Acceleration Feature Introduction
Implementing Resource Isolation Between Sub-users Based on Tags
API Documentation
History
Introduction
API Category
Making API Requests
Online Service APIs
Data Types
Error Codes
Related Agreement
Service Level Agreement
Privacy Policy
Data Processing And Security Agreement
Open-Source Software Information
Contact Us

DescribeModelServiceGroups

PDF
Focus Mode
Font Size
Last updated: 2025-11-13 20:53:05

1. API Description

Domain name for API request: tione.intl.tencentcloudapi.com.

This API is used to list online inference service groups.

A maximum of 20 requests can be initiated per second for this API.

We recommend you to use API Explorer
Try it
API Explorer provides a range of capabilities, including online call, signature authentication, SDK code generation, and API quick search. It enables you to view the request, response, and auto-generated examples.

2. Input Parameters

The following request parameter list only provides API request parameters and some common parameters. For the complete common parameter list, see Common Request Parameters.

Parameter Name Required Type Description
Action Yes String Common Params. The value used for this API: DescribeModelServiceGroups.
Version Yes String Common Params. The value used for this API: 2021-11-11.
Region Yes String Common Params. For more information, please see the list of regions supported by the product.
Offset No Integer Offset. Default value: 0.
Limit No Integer Number of returned results. The default value is 20, and the maximum value is 100.
Order No String The sorting order of the output list. Valid values: ASC (ascending order) and DESC (descending order).
OrderField No String Field to sort by. Valid values: CreateTime and UpdateTime.
Filters.N No Array of Filter Pagination parameters. Supported filterable field names include:["ClusterId", "ServiceId", "ServiceGroupName", "ServiceGroupId","Status","CreatedBy","ModelVersionId"]
TagFilters.N No Array of TagFilter Tag filtering parameters.
ServiceCategory No String Service classification.

3. Output Parameters

Parameter Name Type Description
TotalCount Integer Number of inference service groups.Note: This field may return null, indicating that no valid values can be obtained.
ServiceGroups Array of ServiceGroup Service group information.Note: This field may return null, indicating that no valid values can be obtained.
RequestId String The unique request ID, generated by the server, will be returned for every request (if the request fails to reach the server for other reasons, the request will not obtain a RequestId). RequestId is required for locating a problem.

4. Example

Example1 Listing All Service Groups

Input Example

POST / HTTP/1.1
Host: tione.intl.tencentcloudapi.com
Content-Type: application/json
X-TC-Action: DescribeModelServiceGroups
<Common request parameters>

{
    "Limit": 10,
    "Offset": 0,
    "Filters": []
}

Output Example

{
    "Response": {
        "ServiceGroups": [
            {
                "ServiceGroupId": "ms-vzd8qdk5",
                "ServiceGroupName": "llm_test",
                "CreatedBy": "100032054859",
                "CreateTime": "2023-08-05T05:46:58Z",
                "UpdateTime": "2023-08-08T14:27:02Z",
                "Uin": "100005348929",
                "ServiceCount": 1,
                "RunningServiceCount": 0,
                "Services": [
                    {
                        "ServiceGroupId": "ms-vzd8qdk5",
                        "ServiceId": "ms-vzd8qdk5-1",
                        "ServiceGroupName": "llm_test",
                        "ServiceDescription": "",
                        "ServiceInfo": {
                            "Replicas": 0,
                            "ImageInfo": {
                                "ImageType": "PRE_SET",
                                "ImageUrl": "tione.tencentcloudcr.com/qcloud-ti-platform/chatglm-demo:latest",
                                "RegistryRegion": "",
                                "RegistryId": ""
                            },
                            "Env": [],
                            "Resources": {
                                "Cpu": 12000,
                                "Memory": 45056,
                                "Gpu": 100,
                                "RealGpu": 100,
                                "GpuType": "A10",
                                "RealGpuDetailSet": [
                                    {
                                        "Name": "A10",
                                        "Value": 100
                                    }
                                ]
                            },
                            "InstanceType": "TI.GNV4.3XLARGE44.POST",
                            "ModelInfo": {
                                "ModelId": "m-838176671237008128",
                                "ModelName": "chatglm-6b-tiacc-ft",
                                "ModelVersionId": "mv-v1-838176671237008129",
                                "ModelVersion": "v1",
                                "ModelSource": "COS",
                                "ModelType": "NORMAL",
                                "CosPathInfo": {
                                    "Bucket": "danerli-guangzhou-1256580188",
                                    "Region": "ap-guangzhou",
                                    "Paths": [
                                        "output/ai_market_algo_test_thu_tiacc_ft/train-793371624977165056/output/adgen-chatglm-6b-ft-1e-8/checkpoint-500/model_service.py"
                                    ]
                                },
                                "AlgorithmFramework": "PYTORCH",
                                "ModelFormat": "PYTORCH"
                            },
                            "InferCodeInfo": {
                                "CosPathInfo": {
                                    "Bucket": "danerli-guangzhou-1256580188",
                                    "Region": "ap-guangzhou",
                                    "Paths": [
                                        "output/ai_market_algo_test_thu_tiacc_ft/train-793371624977165056/output/adgen-chatglm-6b-ft-1e-8/checkpoint-500/model_service.py"
                                    ]
                                }
                            },
                            "VolumeMount": {
                                "VolumeSourceType": "CFS",
                                "CFSConfig": {
                                    "Id": "cfs-mpjk7vit",
                                    "Path": "/",
                                    "MountType": "",
                                    "Protocol": ""
                                }
                            },
                            "LogEnable": false,
                            "LogConfig": null,
                            "AuthorizationEnable": false,
                            "ScaleMode": "",
                            "HorizontalPodAutoscaler": null,
                            "CronScaleJobs": [],
                            "ScaleStrategy": "",
                            "ScheduledAction": null,
                            "Status": {
                                "Replicas": 0,
                                "UpdatedReplicas": 0,
                                "ReadyReplicas": 0,
                                "AvailableReplicas": 0,
                                "UnavailableReplicas": 0,
                                "Status": "Stopped",
                                "Reason": "",
                                "Conditions": [
                                    {
                                        "Message": "Deployment has minimum availability.",
                                        "Reason": "MinimumReplicasAvailable",
                                        "Status": "True",
                                        "Type": "Available",
                                        "LastTransitionTime": "2023-08-08T22:27:01+08:00",
                                        "LastUpdateTime": "2023-08-08T22:27:01+08:00"
                                    },
                                    {
                                        "Message": "ReplicaSet \"ms-vzd8qdk5-1-7456b4cc97\" has successfully progressed.",
                                        "Reason": "NewReplicaSetAvailable",
                                        "Status": "True",
                                        "Type": "Progressing",
                                        "LastTransitionTime": "2023-08-08T22:27:02+08:00",
                                        "LastUpdateTime": "2023-08-08T22:27:02+08:00"
                                    }
                                ]
                            },
                            "Weight": 0,
                            "PodList": [],
                            "Pods": null,
                            "PodInfos": [],
                            "ResourceTotal": null,
                            "OldReplicas": 1,
                            "HybridBillingPrepaidReplicas": 0,
                            "OldHybridBillingPrepaidReplicas": 0,
                            "ServiceLimit": null,
                            "ModelHotUpdateEnable": false,
                            "ModelTurboEnable": false
                        },
                        "ClusterId": "",
                        "Region": "ap-guangzhou",
                        "Namespace": "infer-100005348929",
                        "ChargeType": "POSTPAID_BY_HOUR",
                        "ResourceGroupId": "",
                        "ResourceGroupName": "",
                        "Tags": [],
                        "IngressName": "user-ingress-1",
                        "CreatedBy": "100032054859",
                        "CreateTime": "2023-08-05T05:46:58Z",
                        "UpdateTime": "2023-08-08T14:27:02Z",
                        "Uin": "100005348929",
                        "SubUin": "",
                        "AppId": 0,
                        "BusinessStatus": "CREATE_SUCCEED",
                        "CreateFailedReason": "CREATE_SUCCEED",
                        "Status": "Stopped",
                        "BillingInfo": "",
                        "Weight": 100,
                        "CreateSource": "DEFAULT",
                        "Version": "1",
                        "LatestVersion": "",
                        "ServiceLimit": null,
                        "ScheduledAction": null
                    }
                ],
                "Status": "Stopped",
                "Tags": [],
                "BusinessStatus": "",
                "WeightUpdateStatus": "",
                "CreateSource": "DEFAULT",
                "BillingInfo": "",
                "LatestVersion": ""
            }
        ],
        "TotalCount": 1,
        "RequestId": "e5daa146-abed-4d31-ab15-ac9e251fc13c"
    }
}

5. Developer Resources

SDK

TencentCloud API 3.0 integrates SDKs that support various programming languages to make it easier for you to call APIs.

Command Line Interface

6. Error Code

The following only lists the error codes related to the API business logic. For other error codes, see Common Error Codes.

Error Code Description
AuthFailure CAM signature/authentication error.
AuthFailure.CamException Cloud Access Management (CAM) system exception.
AuthFailure.UnauthorizedOperation Unauthorized operation.
FailedOperation Operation failed.
FailedOperation.ApiGatewayQueryFailed Failed to access the API Gateway. Please try again.
FailedOperation.BillingQueryFailed Failed to access the billing module.
FailedOperation.ClusterQueryFailed Cluster access failed.
FailedOperation.DuplicateName Duplicated name.
FailedOperation.ExecDatabaseFail Database execution error.
FailedOperation.ExecTagFail Tag operation failed.
FailedOperation.NoPermission No permissions.
FailedOperation.NotAllow The operation is not allowed.
FailedOperation.QueryDatabaseFail Database query error.
FailedOperation.QueryTagFail Failed to query the tag service.
FailedOperation.StsQueryFailed Failed to access the key service. Please try again.
FailedOperation.UnknownInstanceType Unknown instance specifications.
FailedOperation.UnmarshalData Data parsing failed.
InternalError Internal error.
InvalidParameter Parameter error.
InvalidParameterValue Parameter value error.
InvalidParameterValue.BadName Invalid name.
MissingParameter Error of missing parameters.
OperationDenied Operation denied.
OperationDenied.BalanceInsufficient Insufficient balance. Failed to create or update.
ResourceNotFound The resource does not exist.

Help and Support

Was this page helpful?

Help us improve! Rate your documentation experience in 5 mins.

Feedback