Create conversation

Available in VPC

LLM generates interactive sentences for user queries.

Request

This section describes the request format. The method and URI are as follows:

Method	URI
POST	/api/v1/svc/{serviceId}/conversation

Request headers

For information about the headers common to all RAG APIs, see RAG request headers.

Request path parameters

The following describes the parameters.

Field	Type	Required	Description
`serviceId`	String	Required	Service's unique identifier

Request body

You can include the following data in the body of your request:

Field	Type	Required	Description
`serviceId`	String	Required	Service's unique identifier
`messages`	Array	Required	Message list Array of message objects representing the conversation history
`blocks`	Object	Optional	Configuration object with the name of each block as key (parameters to override default settings) Block name `RETRIEVAL` (document search) \| `GENERATOR` (answer generation)
`blocks.{blockName}.type`	String	Optional	Block type `RETRIEVAL` \| `LLM` For a `RETRIEVAL` block, enter `RETRIEVAL`. For a `GENERATOR` block, enter `LLM`.
`blocks.{blockName}.parameters`	Object	Optional	Detailed configuration parameters for each block RETRIEVAL parameter \| GENERATOR parameter
`citation`	Object	Optional	Citation settings
`citation.enabled`	Boolean	Optional	Whether to display citations in model responses `true` (default) \| `false` `true`: Display citations. `false`: Do not display citations.
`verbose`	Boolean	Optional	Whether to output detailed results by block `true` \| `false` (default) `true`: Display all blocks. `false`: Only display `GENERATOR` blocks.

`messages`

This is an array of messages that provide context for the conversation.

Field	Type	Required	Description
`role`	String	Required	Message role `user` \| `assistant` `user`: User query `assistant`: Model response
`content`	String	Required	Message content

Note

The last element of the array must be role = "user", and the last message will be used as the user's query.

RETRIEVAL parameter

These are configuration parameters for the document search block (blocks.RETRIEVAL.type = "RETRIEVAL").

Field	Type	Required	Description
`k`	Integer	Optional	Number of documents to search Default: 5
`fileName`	String	Optional	File name to search for
`chunkSize`	Integer	Optional	Text chunk size 256-4096 (default: 512)
`overlapSize`	Integer	Optional	Overlapping text size between chunks 0 or more, less than half of `chunkSize` (default: 128)
`filters`	Array	Optional	Filter conditions that limit the search scope

filters

Field	Type	Required	Description
`term`	Object	Optional	Filter documents by exact match values. Field name: `extra.bucket`, `extra.key`, `extra.last_modified`, `extra.size`, `extra.{userField}`, `file_name` Prepend `.keyword` for explicit use when searching for the `extra` field and enter the search value in `value` (string type).
`range`	Object	Optional	Filter documents by number/date range. Field name: `extra.last_modified`, `extra.size`, `extra.{userField}` Operator: `gte` (equal to or greater than), `gt` (greater than), `lte` (equal to or less than), `lt` (less than) Date format: YYYY-MM-DD, file size unit: byte

GENERATOR parameter

These are configuration parameters for the answer generation block (blocks.GENERATOR.type = "LLM").

Field	Type	Required	Description
`chatModel`	String	Optional	LLM model `HyperClovaX` `OpenAI` `Anthropic`
`systemPrompt`	String	Optional	System directive
`llmParameters`	Object	Optional	Detailed configuration parameters for LLM
`llmParameters.model`	String	Optional	LLM model name `HyperClovaX`: `HCX-003` \| `HCX-005` \| `HCX-007` \| `HCX-DASH-001` \| `HCX-DASH-002` \| ... `OpenAI`: `gpt-5` \| `gpt-5-mini` \| ... `Anthropic`: `claude-sonnet-4-5` \| `claude-haiku-4-5` \| `claude-opus-4-1` \| ...
`llmParameters.clovaStudioApiKey`	String	Optional	CLOVA Studio API authentication key See the API key issuance guide.
`llmParameters.temperature`	Float	Optional	Degree of diversity for the generated tokens 0.00-1.00 (default: 0.50) Higher values generate more diverse sentences.
`llmParameters.topK`	Integer	Optional	Sample K high-probability candidates from the pool of generated token candidates. 0-128 (default: 0)
`llmParameters.topP`	Float	Optional	Sample generated token candidates based on cumulative probability. 0.00-1.00 (default: 0.80) It is not used when `chatModel` = `Anthropic`.
`llmParameters.maxTokens`	Integer	Optional	Maximum number of generated tokens 0-2048 (default: 256)

Request example

The request example is as follows:

curl --location --request POST 'https://kr-pub-gateway.rag.naverncp.com/api/v1/svc/{serviceId}/conversation' \
--header 'Authorization: Bearer {apiKey}' \
--header 'Content-Type: application/json' \
--header 'Accept: text/event-stream' \
--data '{
  "serviceId": "{serviceId}",
  "messages": [
    {
      "role": "user",
      "content": "I'm curious about NAVER's technology."
    },
    {
      "role": "assistant",
      "content": "NAVER has a special technology that can digitally recreate the look of a city, so photos taken by drones can be shown in three dimensions on your computer as if they were real!"
    },
    {
      "role": "user",
      "content": "What do people call this technology?"
    }
  ],
  "blocks": {
    "RETRIEVAL": {
      "type": "RETRIEVAL",
      "parameters": {
        "k": 1,
        "fileName": "leap-25.docx",
        "chunkSize": 256,
        "overlapSize": 128,
        "filters": [
          {
            "term": {
              "extra.bucket.keyword": {
                "value": "clova-ai"
              }
            }
          },
          {
            "term": {
              "file_name": "leap-25.docx"
            }
          },
          {
            "range": {
              "extra.size": {
                "lte": 1000000
              }
            }
          }
        ]
      }
    },
    "GENERATOR": {
      "type": "LLM",
      "parameters": {
        "chatModel": "HyperClovaX",
        "systemPrompt": "Answer at the level of an elementary school student.",
        "llmParameters": {
          "model": "HCX-005",
          "clovaStudioApiKey": "{clovaStudioApiKey}",
          "temperature": 0.50,
          "topK": 64,
          "topP": 0.50,
          "maxTokens": 1024
        }
      }
    }
  },
  "citation": {
    "enabled": true
  },
  "verbose": true
}

Response

This section describes the response format.

Response headers

The following describes the response headers.

Headers	Required	Description
`Content-Type`	-	Response data format `application/json`

Response body

The response body includes the following data:

Field	Type	Required	Description
`code`	Integer	-	API response status code
`message`	String	-	API response status message
`data`	Object	-	API processing result
`data.results`	Array	-	RAG processing step-by-step result
`data.references`	Array	-	Citation information
`data.rateLimit`	Object	-	API usage limit information
`data.rateLimit.limitRequests`	String	-	Maximum number of requests
`data.rateLimit.remainingRequests`	String	-	Remaining number of requests
`data.rateLimit.resetRequests`	String	-	Request limit reset time (second)
`data.rateLimit.limitTokens`	String	-	Maximum number of tokens
`data.rateLimit.remainingTokens`	String	-	Remaining number of tokens
`data.rateLimit.resetTokens`	String	-	Token limit reset time (second)

`data.results`

Each element in the array represents a sequential RAG processing step.

Field	Type	Required	Description
`blockId`	String	-	RAG processing step `START` \| `QUERY_REWRITE` \| `RETRIEVAL` \| `CONDITIONAL` \| `GENERATOR` `START`: User query entry `QUERY_REWRITE`: Query rewrite `RETRIEVAL`: Document search `CONDITIONAL`: Branch selection `GENERATOR`: Final answer generation
`value`	String \| Array	-	Detailed result by block `START`: User query `QUERY_REWRITE`: Rewritten query `RETRIEVAL`: Document search result `CONDITIONAL`: Branch result `GENERATOR`: LLM final answer
`metadata`	Object	-	Metadata by RAG processing step
`metadata.finishReason`	String	-	Reason for stopping token generation (only displayed on `QUERY_REWRITE` and `GENERATOR` blocks) `length` \| `stop` \| `tool_calls` `length`: Length limit `stop`: Character specified in stop occurred during answer generation. `tool_calls`: Model successfully completed tool call.
`metadata.chatModel`	String	-	LLM model (only displayed on `QUERY_REWRITE` and `GENERATOR` blocks) `HyperClovaX` `OpenAI` `Anthropic`
`metadata.modelName`	String	-	LLM model name `HyperClovaX`: `HCX-003` \| `HCX-005` \| `HCX-007` \| `HCX-DASH-001` \| `HCX-DASH-002` \| ... `OpenAI`: `gpt-5` \| `gpt-5-mini` \| ... `Anthropic`: `claude-sonnet-4-5` \| `claude-haiku-4-5` \| `claude-opus-4-1` \| ...
`metadata.inputTokens`	Integer	-	Input token count (only displayed on `QUERY_REWRITE` and `GENERATOR` blocks)
`metadata.outputTokens`	Integer	-	Output token count (only displayed on `QUERY_REWRITE` and `GENERATOR` blocks)
`metadata.totalTokens`	Integer	-	Total token count (only displayed on `QUERY_REWRITE` and `GENERATOR` blocks)
`metadata.thinkingContent`	String	-	Inference output result (only displayed on `GENERATOR` blocks when using `HCX-007`)

Document search result

Field	Type	Required	Description
`docId`	String	-	Document's unique identifier
`fileName`	String	-	File name to search for
`score`	Float	-	Search similarity score 0-1 Higher values indicate higher similarity.
`rank`	Integer	-	Search result rank
`text`	String	-	Text searched from the document
`start_offset`	Integer	-	Character position in the document where the text chunk starts
`end_offset`	Integer	-	Character position in the document where the text chunk ends
`chunk_no`	Integer	-	Chunk number
`page_no`	Integer	-	Page number

`data.references`

This is the citation information used to generate the answer.

Field	Type	Required	Description
`name`	String	-	Block name `RETRIEVAL`
`data`	Array	-	Citation details

data

Field	Type	Required	Description
`docId`	String	-	Document's unique identifier
`fileName`	String	-	Citation target file name
`text`	String	-	Text cited from the document
`start_offset`	Integer	-	Character position in the document where the text chunk starts
`end_offset`	Integer	-	Character position in the document where the text chunk ends
`chunk_no`	Integer	-	Chunk number
`page_no`	Integer	-	Page number
`extra`	Object	-	Additional Information
`extra.bucket`	String	-	Bucket name
`extra.size`	Integer	-	File size (byte)
`extra.last_modified`	String	-	Last modification date and time ISO 8601 format
`extra.key`	String	-	File name

Response example

The response example is as follows:

{
    "code": 200,
    "message": null,
    "data": {
        "results": [
            {
                "blockId": "START",
                "value": "What do people call this technology?"
            },
            {
                "blockId": "QUERY_REWRITE",
                "value": "What is the name people use for NAVER's technology?",
                "metadata": {
                    "finishReason": "length",
                    "chatModel": "HyperClovaX",
                    "modelName": "HCX-007",
                    "inputTokens": 1355,
                    "outputTokens": 256,
                    "totalTokens": 1611
                }
            },
            {
                "blockId": "RETRIEVAL",
                "value": [
                    {
                        "docId": "c69cd98c0674e1a8a814",
                        "fileName": "leap-25.docx",
                        "score": 0.04168701171875,
                        "rank": 1,
                        "text": "NAVER has a unique technology that can quickly and accurately create digital twin data for entire megacities. Using aerial photography and photogrammetry, NAVER can quickly and accurately create megacity-scale digital twin data, which is then restored to life-like three-dimensional space using AI technology. In Korea, NAVER already offers a VR service through NAVER Pay that allows users to view real estate listings without having to visit the actual property.",
                        "start_offset": 4025,
                        "end_offset": 4258,
                        "chunk_no": 1,
                        "page_no": 0
                    }
                ]
            },
            {
                "blockId": "CONDITIONAL",
                "value": "branch1"
            },
            {
                "blockId": "GENERATOR",
                "value": "This technology that NAVER has is called Digital Twin. A digital twin is a technology that creates an exact replica of a physical object in the real world in a virtual world so that it can be monitored and controlled in real time. In other words, it is like a twin of a real space in a computer.\n\nFor example, if you create a digital twin of the neighborhood or school building where you live, you can look around the building on your computer screen without actually being there, walk around the streets and check out the surrounding facilities. It can also be used to predict future situations or solve problems by simulating weather changes or traffic conditions.\n\nNAVER is developing new services in various fields using this digital twin technology. For example, a virtual fitting room that lets you try on clothes while shopping online, or a feature that lets you explore the interior of your home in 3D and decorate it.\n\nDigital twin technology is a wonderful tool that makes our daily lives easier and sparks our imagination. As it continues to evolve, we can expect to see it used in even more ways.",
                "metadata": {
                    "finishReason": "length",
                    "chatModel": "HyperClovaX",
                    "modelName": "HCX-007",
                    "inputTokens": 1355,
                    "outputTokens": 256,
                    "totalTokens": 1611,
                    "thinkingContent": "The user is asking about the concept of digital twin technology and its application at NAVER, so I'll explain it step by step, starting with the definition of the technology itself and ending with real-world examples."
                }
            }
        ],
        "references": [
            {
                "name": "RETRIEVAL",
                "data": [
                    {
                        "docId": "c69cd98c0674e1a8a814",
                        "fileName": "leap-25.docx",
                        "text": "NAVER has a unique technology that can quickly and accurately create digital twin data for entire megacities. Using aerial photography and photogrammetry, NAVER can quickly and accurately create megacity-scale digital twin data, which is then restored to life-like three-dimensional space using AI technology. In Korea, NAVER already offers a VR service through NAVER Pay that allows users to view real estate listings without having to visit the actual property.",
                        "startOffset": 4025,
                        "endOffset": 4258,
                        "chunkNo": 1,
                        "pageNo": 0,
                        "extra": {
                            "bucket": "clova-ai",
                            "size": 966060,
                            "last_modified": "2025-07-02T10:17:19+00:00",
                            "key": "leap-25.docx"
                        }
                    }
                ]
            }
        ],
        "rateLimit": {
            "limitRequests": null,
            "remainingRequests": null,
            "resetRequests": null,
            "limitTokens": "60000",
            "remainingTokens": "58324",
            "resetTokens": "59s"
        }
    }
}

Response stream

Server-Sent Events (SSE) provides real-time streaming responses. Each SSE message consists of the following format. (Events are separated by blank lines.)

event: [event_type]
data: [json_data]

The Create conversation API supports five event types: step, ping, token, result, and error.

step: Output of block-by-block processing results before generating answers (if verbose = true).
ping: Output data to prevent read timeouts while waiting before generating the answer (every second until the answer is generated).
token: Output the token of the answer being generated in real time.
result: Output the final answer.
error: Output when an error occurs during answer generation.

Response headers

The following describes the response headers.

Headers	Required	Description
`Accept`	-	Response data format `text/event-stream`

Response body

The response body differs depending on the event type.

`step` event

The following describes the step events.

Field	Type	Required	Description
`blockName`	String	-	RAG processing step `START` \| `QUERY_REWRITE` \| `RETRIEVAL` \| `CONDITIONAL` `START`: User query entry `QUERY_REWRITE`: Query rewrite `RETRIEVAL`: Document search `CONDITIONAL`: Branch selection
`blockResult`	String \| Array	-	Detailed result by block `START`: User query `QUERY_REWRITE`: Rewritten query `RETRIEVAL`: Document search result `CONDITIONAL`: Branch result

Document search result

Field	Type	Required	Description
`docId`	String	-	Document's unique identifier
`fileName`	String	-	File name to search for
`score`	Float	-	Search similarity score 0-1 Higher values indicate higher similarity.
`rank`	Integer	-	Search result rank
`text`	String	-	Text searched from the document
`start_offset`	Integer	-	Character position in the document where the text chunk starts
`end_offset`	Integer	-	Character position in the document where the text chunk ends
`chunk_no`	Integer	-	Chunk number
`page_no`	Integer	-	Page number

`ping`, `token`, `result` events

The following describes the ping, token, and result events.

Field	Type	Required	Description
`message`	Object	-	Message information
`message.role`	String	-	Message role `assistant`
`message.content`	String	-	Currently generated token In `result` event, all generated tokens
`message.thinkingContent`	String	-	Currently generated inference token In `token` event, no output once inference is done In `result` event, all generated tokens
`status`	String	-	Message status `IN_PROGRESS` \| `COMPLETE` `IN_PROGRESS`: Creating `COMPLETE`: Creation completed
`references`	Array	-	Citation information (only included in `result` events)
`rateLimit`	Object	-	API usage limit information (only included in `result` events)
`rateLimit.limitRequests`	String	-	Request count limit
`rateLimit.remainingRequests`	String	-	Remaining number of requests
`rateLimit.resetRequests`	String	-	Request limit reset time (second)
`rateLimit.limitTokens`	String	-	Token count limit
`rateLimit.remainingTokens`	String	-	Remaining number of tokens
`rateLimit.resetTokens`	String	-	Token limit reset time (second)
`metadata`	String	-	Requested metadata (only included in `result` events)
`metadata.finishReason`	String	-	Reason for stopping token generation `length` \| `stop` \| `tool_calls` `length`: Length limit `stop`: Character specified in stop occurred during answer generation. `tool_calls`: Model successfully completed tool call.
`metadata.chatModel`	String	-	LLM model `HyperClovaX` `OpenAI` `Anthropic`
`metadata.modelName`	String	-	LLM model name `HyperClovaX`: `HCX-003` \| `HCX-005` \| `HCX-007` \| `HCX-DASH-001` \| `HCX-DASH-002` \| ... `OpenAI`: `gpt-5` \| `gpt-5-mini` \| ... `Anthropic`: `claude-sonnet-4-5` \| `claude-haiku-4-5` \| `claude-opus-4-1` \| ...
`metadata.inputTokens`	Integer	-	Input token count
`metadata.outputTokens`	Integer	-	Output token count
`metadata.totalTokens`	Integer	-	Total token count

references

Field	Type	Required	Description
`name`	String	-	Block name `RETRIEVAL`
`data`	Array	-	Citation details

data

Field	Type	Required	Description
`docId`	String	-	Document's unique identifier
`fileName`	String	-	Citation target file name
`text`	String	-	Text cited from the document
`start_offset`	Integer	-	Character position in the document where the text chunk starts
`end_offset`	Integer	-	Character position in the document where the text chunk ends
`chunk_no`	Integer	-	Chunk number
`page_no`	Integer	-	Page number
`extra`	Object	-	Additional Information
`extra.bucket`	String	-	Bucket name
`extra.size`	Integer	-	File size (byte)
`extra.last_modified`	String	-	Last modification date and time ISO 8601 format
`extra.key`	String	-	Object Storage key

`error` event

The following describes the error events.

Field	Type	Required	Description
`code`	Integer	Required	API response status code
`message`	String	Required	API response status message

Response status codes

For information about the HTTP status codes common to all RAG APIs, see RAG response status codes.

Response example

The response example is as follows:

event: step
data: {
    "blockName": "START",
    "blockResult": "What do people call this technology?"
}

event: step
data: {
    "blockName": "QUERY_REWRITE",
    "blockResult": "What is the name people use for NAVER's technology?"
}

event: step
data: {
    "blockName": "RETRIEVAL",
    "blockResult": [
        {
            "docId": "{docId}",
            "fileName": "leap-25.docx",
            "score": 0.04168701171875,
            "rank": 1,
            "text": "NAVER has a unique technology that can quickly and accurately create digital twin data for entire megacities. Using aerial photography and photogrammetry, NAVER can quickly and accurately create megacity-scale digital twin data, which is then restored to life-like three-dimensional space using AI technology. In Korea, NAVER already offers a VR service through NAVER Pay that allows users to view real estate listings without having to visit the actual property.",
            "start_offset": 4025,
            "end_offset": 4258,
            "chunk_no": 1,
            "page_no": 0
        }
    ]
}

event: step
data: {
    "blockName": "CONDITIONAL",
    "blockResult": "branch1"
}

event: ping
data: {
    "message": {
        "role": "assistant",
        "content": ""
    },
    "status": "IN_PROGRESS",
    "references": [],
    "rateLimit": null,
    "metadata": {}
}

...

event: token
data: {
    "message": {
        "role": "assistant",
        "content": "",
        "thinkingContent": "By user"
    },
    "status": "IN_PROGRESS",
    "references": [],
    "rateLimit": null,
    "metadata": {}
}

...

event: token
data: {
    "message": {
        "role": "assistant",
        "content": " Digital"
    },
    "status": "IN_PROGRESS",
    "references": [],
    "rateLimit": null,
    "metadata": {}
}

...

event: result
data: {
    "message": {
        "role": "assistant",
        "content": "This technology that NAVER has is called Digital Twin.[†1] A digital twin is an exact replica of a physical object in the real world in the virtual world, which allows us to perform various simulations and analyses to help us make decisions.\n\nFor example, if we draw a house on a piece of paper, we draw a picture that looks exactly like the house we will build. Then we can plan and experiment with different things on it, like if we want to know how water flows when it rains, we can make it rain on the drawing and observe the results.\n\nThis virtual world can help us save time and money by allowing us to test different situations before we build a real building or city. It can also help us manage real buildings. For example, we can monitor whether the machines in the building are working properly or using energy efficiently, so we can take action before something goes wrong.\n\nNAVER is utilizing this digital twin technology to recreate various regions of Korea in the virtual world and develop various services based on them.[†1] This means that we will be able to see and experience information about the region on our computers or smartphones without having to go there. It's really exciting, right?",
        "thinkingContent": "The user is asking about the concept of digital twin technology and its application at NAVER, so I'll explain it step by step, starting with the definition of the technology itself and ending with real-world examples."
    },
    "status": "COMPLETE",
    "references": [
        {
            "name": "RETRIEVAL",
            "data": [
                {
                    "docId": "12345",
                    "fileName": "leap-25.docx",
                    "text": "NAVER has a unique technology that can quickly and accurately create digital twin data for entire megacities. Using aerial photography and photogrammetry, NAVER can quickly and accurately create megacity-scale digital twin data, which is then restored to life-like three-dimensional space using AI technology. In Korea, NAVER already offers a VR service through NAVER Pay that allows users to view real estate listings without having to visit the actual property.",
                    "startOffset": 4025,
                    "endOffset": 4258,
                    "chunkNo": 1,
                    "pageNo": 0,
                    "extra": {
                        "bucket": "clova-ai",
                        "size": 966060,
                        "last_modified": "2025-07-02T10:17:19+00:00",
                        "key": "leap-25.docx"
                    }
                }
            ]
        }
    ],
    "rateLimit": {
        "limitRequests": null,
        "remainingRequests": null,
        "resetRequests": null,
        "limitTokens": "60000",
        "remainingTokens": "56648",
        "resetTokens": "28s"
    },
    "metadata": {
        "finishReason": "stop",
        "chatModel": "HyperClovaX",
        "modelName": "HCX-005",
        "inputTokens": 267,
        "outputTokens": 262,
        "totalTokens": 529
    }
}

Create conversation

Request

Request headers

Request path parameters

Request body

messages

RETRIEVAL parameter

GENERATOR parameter

Request example

Response

Response headers

Response body

data.results

data.references

Response example

Response stream

Response headers

Response body

step event

ping, token, result events

error event

Response status codes

Response example

`messages`

`data.results`

`data.references`

`step` event

`ping`, `token`, `result` events

`error` event