Yui AI

API Documentation (Ollama-like)

List Models: GET /api/tags

Returns a list of available models.

Example Response:

{
  "models": [
    {
      "name": "0727-360B-API",
      "model": "0727-360B-API",
      "digest": "",
      "size": 0,
      "modified_at": "2025-09-29T12:00:00.000Z",
      "details": {
        "format": "",
        "family": "GLM",
        "families": [
          "GLM"
        ],
        "parameter_size": "",
        "quantization_level": ""
      }
    },
    {
      "name": "glm-4.5v",
      "model": "glm-4.5v",
      "digest": "",
      "size": 0,
      "modified_at": "2025-09-29T12:00:00.000Z",
      "details": {
        "format": "",
        "family": "GLM",
        "families": [
          "GLM"
        ],
        "parameter_size": "",
        "quantization_level": ""
      }
    }
  ]
}

Chat Completion: POST /api/chat

Sends a chat message and receives a streaming AI response.

Request Body:

{
  "model": "<selected_model>",
  "messages": [
    { "role": "user", "content": "Your message here" },
    { "role": "assistant", "content": "AI response here" }
  ],
  "stream": true, // Always true for streaming response
  "options": {
    "webSearch": false, // boolean
    "enableThinking": false // boolean
  }
}

Example Request:

Example Streaming Response (application/x-ndjson):

{
  "model": "0727-360B-API",
  "created_at": "2025-09-29T12:00:00.000Z",
  "message": {
    "role": "assistant",
    "content": "Halo! "
  },
  "done": false
}
{
  "model": "0727-360B-API",
  "created_at": "2025-09-29T12:00:00.000Z",
  "message": {
    "role": "assistant",
    "content": "Saya adalah "
  },
  "done": false
}
...
{
  "model": "0727-360B-API",
  "created_at": "2025-09-29T12:00:00.000Z",
  "done": true
}