TTS (Premium)
    • PDF

    TTS (Premium)

    • PDF

    Article summary

    Available in Classic and VPC

    Synthesize speech by taking in text to convert and parameters like tone, speed, and emotion.

    Request

    The following describes the request format for the endpoint. The request format is as follows:

    MethodURI
    POST/tts

    Request headers

    For headers common to all CLOVA Voice APIs, see Common CLOVA Voice headers.

    Request body

    The following describes the request body.

    FieldTypeRequiredDescription
    speakerStringRequiredVoice type to use for speech synthesis
    textStringRequiredText to be converted to speech
    • Only support UTF-8-encoded text
    • Synthesize speech up to 2000 characters
    • Text in symbols or parentheses is not converted
    volumeIntegerOptionalSpeech volume
    • -5 - 5 (default: 0)
      • -5: Synthesize 0.5x smaller
      • 0: Synthesize to normal volume
      • 5: Synthesize 1.5x larger
    speedIntegerOptionalSpeech speed
    • -5 - 5 (default: 0)
      • -5: Synthesize 2x faster
      • 0: Synthesize to normal speed
      • 5: Synthesize 0.5x slower
    pitchIntegerOptionalSpeech pitch
    • -5 - 5 (default: 0)
      • -5: Synthesize 1.2x higher
      • 0: Synthesize to normal pitch
      • 5: Synthesize 0.8x lower
    emotionIntegerOptionalEmotion level of speech
    • Supported voices: nara | vara | vmikyung | vdain | vyuna | vgoeun | vdaeseong
    • 0 - 3 (default: 0)
      • 0: neutral
      • 1: sad
      • 2: happy
      • 3: angry (nara not supported)
    emotion-strengthIntegerOptionalEmotion intensity of speech
    • Supported voices: vara | vmikyung | vdain | vyuna | vgoeun | vdaeseong
    • 0 - 2 (default: 1)
      • 0: weak
      • 1: normal
      • 2: strong
    formatStringOptionalSpeech file format
    • mp3 (default) | wav
    sampling-rateIntegerOptionalSampling rate of speech
    • Only supported for wav format
    • 8000 | 16000 | 24000 (default)| 48000
      • Exceptionally, mijin supports the 16000 rate for mp3 format
    alphaIntegerOptionalTone
    • -5 - 5 (default: 0)
      • If higher than 0: high tone
      • If lower than 0: low tone
    end-pitchIntegerOptionalEnd-pitch processing of speech
    • Supported voices: clara | matt | meimei | liangliang | chiahua | kuanlin | carmen | jose | all voices starting with d- (<e.g.> dara)
    • -5 - 5 (default: 0)
      • If higher than 0: high end-pitch
      • If lower than 0: low end-pitch

    List of speech synthesis voices

    The following describes in detail the types of voices that will be used for speech synthesis.

    FieldNameLanguageDescription
    naraAraKoreanFemale
    nara_callAra (agent)KoreanFemale
    nminyoungMinyoungKoreanFemale
    nyejinYejinKoreanFemale
    mijinMijinKoreanFemale
    jinhoJinhoKoreanMale
    claraClaraEnglishFemale
    mattMattEnglishMale
    shinjiShinjiJapaneseMale
    meimeiMeimeiChineseFemale
    liangliangLiangliangChineseMale
    joseJoseSpanishMale
    carmenCarmenSpanishFemale
    nminsangMinsangKoreanMale
    nsinuSinuKoreanMale
    nhajunHajunKoreanChild (male)
    ndainDainKoreanChild (female)
    njiyunJiyunKoreanFemale
    nsujinSujinKoreanFemale
    njinhoJinhoKoreanMale
    njihunJihunKoreanMale
    njooahnJooahnKoreanMale
    nseonghoonSeonghoonKoreanMale
    njihwanJihwanKoreanMale
    nsiyoonSiyoonKoreanMale
    ngaramGaramKoreanChild (female)
    ntomokoTomokoJapaneseFemale
    nnaomiNaomiJapaneseFemale
    dnaomi_joyfulNaomi (happy)JapaneseFemale
    dnaomi_formalNaomi (news)JapaneseFemale
    drikoRikoJapaneseFemale
    derikoErikoJapaneseFemale
    nsayuriSayuriJapaneseFemale
    ngoeunGoeunKoreanFemale
    neunyoungEunyoungKoreanFemale
    nsunkyungSunkyungKoreanFemale
    nyujinYujinKoreanFemale
    ntaejinTaejinKoreanMale
    nyoungilYoungilKoreanMale
    nseungpyoSeungpyoKoreanMale
    nwontakWontakKoreanMale
    dara_angAra (angry)KoreanFemale
    nsunheeSunheeKoreanFemale
    nminseoMinseoKoreanFemale
    njiwonJiwonKoreanFemale
    nboraBoraKoreanFemale
    njonghyunJonghyunKoreanMale
    njoonyoungJoonyoungKoreanMale
    njaewookJaewookKoreanMale
    dannaAnnaEnglishFemale
    djoeyJoeyEnglishFemale
    dhajimeHajimeJapaneseMale
    ddaikiDaikiJapaneseMale
    dayumuAyumuJapaneseMale
    dmioMioJapaneseFemale
    chiahuaChiahuaTaiwaneseFemale
    kuanlinKuanlinTaiwaneseMale
    nes_c_hyeriHyeriKoreanFemale
    nes_c_sohyunSohyunKoreanFemale
    nes_c_mikyungMikyungKoreanFemale
    nes_c_kihyoKihyoKoreanMale
    ntiffanyKiseoKoreanFemale
    nappleNeulbomKoreanFemale
    njangjDreamKoreanFemale
    noyjBomdalKoreanFemale
    neunseoEunseoKoreanFemale
    nheeraHeeraKoreanFemale
    nyoungmiYoungmiKoreanFemale
    nnaraeNaraeKoreanFemale
    nyejiYejiKoreanFemale
    nyunaYunaKoreanFemale
    nkyungleeKyungleeKoreanFemale
    nminjeongMinjeongKoreanFemale
    nihyunIhyunKoreanFemale
    nraewonRaewonKoreanMale
    nkyuwonKyuwonKoreanMale
    nkitaeKitaeKoreanMale
    neunwooEunwooKoreanMale
    nkyungtaeKyungtaeKoreanMale
    nwoosikWoosikKoreanMale
    varaAra (Pro)KoreanFemale
    vmikyungMikyung (Pro)KoreanFemale
    vdainDain (Pro)KoreanFemale
    vyunaYuna (Pro)KoreanFemale
    vhyeriHyeri (Pro)KoreanFemale
    dara-dannaAra & AnnaKorean + English (U.S.)Female
    dsinu-mattSinu & MattKorean + English (U.S.)Male
    nsabinaWitch SabinaKoreanFemale
    nmammonDemon MammonKoreanMale
    nmeowMeowKoreanChild (female)
    nwoofWoofKoreanChild (male)
    nreviewReview ParkKoreanMale
    nyounghwaMovie JeongKoreanFemale
    nmovieMovie ChoiKoreanMale
    nsangdoSangdoKoreanMale
    nshashaShashaKoreanFemale
    nianIanKoreanMale
    ndonghyunDonghyunKoreanMale
    vianIan (Pro)KoreanMale
    vdonghyunDonghyun (Pro)KoreanMale
    dsayuriSayuriJapaneseFemale
    dtomokoTomokoJapaneseFemale
    dnaomiNaomiJapaneseFemale
    vgoeunGoeun (Pro)KoreanFemale
    vdaeseongDaeseong (Pro)KoreanMale
    ngyeongjunGyeongjunKoreanMale
    ndaeseongDaeseongKoreanMale
    njonghyeokJonghyeokKoreanMale

    Request example

    The following is a sample request.

    curl --location --request POST 'https://naveropenapi.apigw.ntruss.com/tts-premium/v1/tts' \
    --header 'X-NCP-APIGW-API-KEY-ID: {Client ID issued when registering the app}' \
    --header 'X-NCP-APIGW-API-KEY: {Client secret issued when registering the app}' \
    --header 'Content-Type: application/x-www-form-urlencoded' \
    --data-urlencode 'speaker=nara' \
    --data-urlencode 'text=Hello' \
    --data-urlencode 'volume=0' \
    --data-urlencode 'speed=-1' \
    --data-urlencode 'pitch=1' \
    --data-urlencode 'emotion=2' \
    --data-urlencode 'emotion-strength=1' \
    --data-urlencode 'format=wav' \
    --data-urlencode 'sampling-rate=8000' \
    --data-urlencode 'alpha=0' \
    --data-urlencode 'end-pitch=0'
    

    Response

    The following describes the response format.

    Response body

    The following describes the response body.

    FieldTypeRequiredDescription
    Responded TTS audioBinary-Binary voice data in MP3 or WAV format

    Response status codes

    For response status codes common to all CLOVA Voice APIs, see Common CLOVA Voice response status codes.

    Response example

    The following is a sample example.

    {Binary voice data in MP3 or WAV format}
    

    Was this article helpful?

    Changing your password will log you out immediately. Use the new password to log back in.
    First name must have atleast 2 characters. Numbers and special characters are not allowed.
    Last name must have atleast 1 characters. Numbers and special characters are not allowed.
    Enter a valid email
    Enter a valid password
    Your profile has been successfully updated.