Repository Mirror for your Cloud Server and Webhosting

Title:

Call Google's 'Natural Language', 'Cloud Translation', 'Cloud Speech', and 'Cloud Text-to-Speech' APIs

Version:

0.3.1.1

Description:

Access Google Cloud machine learning APIs for text and speech tasks. Use the Cloud Translation API for text detection and translation, the Natural Language API to analyze sentiment, entities, and syntax, the Cloud Speech API to transcribe audio to text, and the Cloud Text-to-Speech API to synthesize text into audio files.

URL:

https://github.com/ropensci/googleLanguageR

BugReports:

https://github.com/ropensci/googleLanguageR/issues

Depends:

R (≥ 3.3)

License:

MIT + file LICENSE

Encoding:

UTF-8

RoxygenNote:

7.3.3

VignetteBuilder:

knitr

Imports:

assertthat, base64enc, googleAuthR (≥ 1.1.1), jsonlite, purrr (≥ 0.2.4), stats, tibble, utils

Suggests:

pdftools, cld2, testthat, knitr, rmarkdown, rvest, shiny, shinyjs, stringdist, tidyr, tuneR, xml2, av, magrittr

NeedsCompilation:

Packaged:

2025-10-22 10:15:18 UTC; Isabella

Author:

Aleksander Dietrichson [ctb], Mark Edmondson [aut], John Muschelli [ctb], Neal Richardson [rev] (Reviewed package for ropensci), Julia Gustavsen [rev] (Reviewed package for ropensci), Cheryl Isabella Lim [aut, cre]

Maintainer:

Cheryl Isabella Lim <cheryl.academic@gmail.com>

Repository:

CRAN

Date/Publication:

2025-10-22 11:00:09 UTC

googleLanguageR

Description

This package contains functions for analysing language through the Google Cloud Machine Learning APIs

Details

For examples and documentation see the vignettes and the website:

https://github.com/ropensci/googleLanguageR

Author(s)

Maintainer: Cheryl Isabella cheryl.academic@gmail.com

Authors:

Mark Edmondson r@sunholo.com

Other contributors:

Aleksander Dietrichson dietrichson@gmail.com [contributor]
John Muschelli muschellij2@gmail.com [contributor]
Neal Richardson neal.p.richardson@gmail.com (Neal reviewed the package for ropensci) [reviewer]
Julia Gustavsen j.gustavsen@gmail.com (Julia reviewed the package for ropensci) [reviewer]

Determine API endpoint version

Description

Used to specify the API endpoint: GitHub beta versions or CRAN stable.

Usage

get_version(api = c("nlp", "tts", "speech", "trans"))

Arguments

api

Character, one of "nlp", "tts", "speech", "trans"

Value

Character string of API version

Authenticate with Google Language API services

Description

Authenticate with Google Language API services

Usage

gl_auth(json_file)

gl_auto_auth(...)

Arguments

json_file

Character. Path to the JSON authentication file downloaded from your Google Cloud project.

...

Additional arguments passed to gar_attach_auto_auth.

Details

This function authenticates with Google Cloud's language APIs. By default, it uses the JSON file specified in json_file. Alternatively, you can set the file path in the environment variable GL_AUTH to auto-authenticate when loading the package.

Examples

## Not run: 
library(googleLanguageR)
gl_auth("path/to/json_file.json")

## End(Not run)
## Not run: 
library(googleLanguageR)
gl_auto_auth()
gl_auto_auth(environment_var = "GAR_AUTH_FILE")

## End(Not run)

Perform Natural Language Analysis

Description

Analyse text for entities, sentiment, syntax and classification using the Google Natural Language API.

Usage

gl_nlp(
  string,
  nlp_type = c("annotateText", "analyzeEntities", "analyzeSentiment", "analyzeSyntax",
    "analyzeEntitySentiment", "classifyText"),
  type = c("PLAIN_TEXT", "HTML"),
  language = c("en", "zh", "zh-Hant", "fr", "de", "it", "ja", "ko", "pt", "es"),
  encodingType = c("UTF8", "UTF16", "UTF32", "NONE")
)

Arguments

string

Character vector. Text to analyse or Google Cloud Storage URI(s) in the form gs://bucket_name/object_name.

nlp_type

Character. Type of analysis to perform. Default annotateText performs all features in a single call. Options include: analyzeEntities, analyzeSentiment, analyzeSyntax, analyzeEntitySentiment, classifyText.

type

Character. Whether the input is plain text (PLAIN_TEXT) or HTML (HTML).

language

Character. Language of the source text. Must be supported by the API.

encodingType

Character. Text encoding used to process the output. Default UTF8.

Details

Encoding type can usually be left at the default UTF8. Further details on encoding types.

Current language support is listed here.

Value

A list containing the requested components as specified by nlp_type:

sentences

Sentences in the input document. API reference.

tokens

Tokens with syntactic information. API reference.

entities

Entities with semantic information. API reference.

documentSentiment

Overall sentiment of the document. API reference.

classifyText

Document classification. API reference.

language

Detected language of the text, or the language specified in the request.

text

Original text passed to the API. Returns NA if input is empty.

Examples

## Not run: 
library(googleLanguageR)

text <- "To administer medicine to animals is frequently difficult, yet sometimes necessary."
nlp <- gl_nlp(text)

nlp$sentences
nlp$tokens
nlp$entities
nlp$documentSentiment

# Vectorised input
texts <- c("The cat sat on the mat.", "Oh no, it did not, you fool!")
nlp_results <- gl_nlp(texts)

## End(Not run)

Call Google Speech API

Description

Turn audio into text

Usage

gl_speech(
  audio_source,
  encoding = c("LINEAR16", "FLAC", "MULAW", "AMR", "AMR_WB", "OGG_OPUS",
    "SPEEX_WITH_HEADER_BYTE"),
  sampleRateHertz = NULL,
  languageCode = "en-US",
  maxAlternatives = 1L,
  profanityFilter = FALSE,
  speechContexts = NULL,
  asynch = FALSE,
  customConfig = NULL
)

Arguments

audio_source

File location of audio data, or Google Cloud Storage URI

encoding

Encoding of audio data sent

sampleRateHertz

Sample rate in Hertz of audio data. Valid values 8000-48000. Optimal and default if left NULL is 16000

languageCode

Language of the supplied audio as a BCP-47 language tag

maxAlternatives

Maximum number of recognition hypotheses to be returned. 0-30

profanityFilter

If TRUE will attempt to filter out profanities

speechContexts

An optional character vector of context to assist the speech recognition

asynch

If your audio_source is greater than 60 seconds, set this to TRUE to return an asynchronous call

customConfig

[optional] A RecognitionConfig object that will be converted from a list to JSON via toJSON - see RecognitionConfig documentation. The languageCode will be taken from this functions arguments if not present since it is required.

Details

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases. Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

Value

A list of two tibbles: $transcript, a tibble of the transcript with a confidence; $timings, a tibble that contains startTime, endTime per word. If maxAlternatives is greater than 1, then the transcript will return near-duplicate rows with other interpretations of the text. If asynch is TRUE, then an operation you will need to pass to gl_speech_op to get the finished result.

AudioEncoding

Audio encoding of the data sent in the audio message. All encodings support only 1 channel (mono) audio. Only FLAC and WAV include a header that describes the bytes of audio that follow the header. The other encodings are raw audio bytes with no header. For best results, the audio source should be captured and transmitted using a lossless encoding (FLAC or LINEAR16). Recognition accuracy may be reduced if lossy codecs, which include the other codecs listed in this section, are used to capture or transmit the audio, particularly if background noise is present.

Read more on audio encodings here https://cloud.google.com/speech-to-text/docs/encoding

WordInfo

startTime - Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word.

endTime - Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word.

word - The word corresponding to this set of information.

Examples


## Not run: 

test_audio <- system.file("woman1_wb.wav", package = "googleLanguageR")
result <- gl_speech(test_audio)

result$transcript
result$timings

result2 <- gl_speech(test_audio, maxAlternatives = 2L)
result2$transcript

result_brit <- gl_speech(test_audio, languageCode = "en-GB")


## make an asynchronous API request (mandatory for sound files over 60 seconds)
asynch <- gl_speech(test_audio, asynch = TRUE)

## Send to gl_speech_op() for status or finished result
gl_speech_op(asynch)

## Upload to GCS bucket for long files > 60 seconds
test_gcs <- "gs://mark-edmondson-public-files/googleLanguageR/a-dream-mono.wav"
gcs <- gl_speech(test_gcs, sampleRateHertz = 44100L, asynch = TRUE)
gl_speech_op(gcs)

## Use a custom configuration
my_config <- list(encoding = "LINEAR16",
                  diarizationConfig = list(
                    enableSpeakerDiarization = TRUE,
                    minSpeakerCount = 2,
                    maxSpeakCount = 3
                    ))

# languageCode is required, so will be added if not in your custom config
gl_speech(my_audio, languageCode = "en-US", customConfig = my_config)


## End(Not run)

Get a speech operation

Description

For asynchronous calls of audio over 60 seconds, this returns the finished job

Usage

gl_speech_op(operation = .Last.value)

Arguments

operation

A speech operation object from gl_speech when asynch = TRUE

Value

If the operation is still running, another operation object. If done, the result as per gl_speech

Examples


## Not run: 

test_audio <- system.file("woman1_wb.wav", package = "googleLanguageR")

## make an asynchronous API request (mandatory for sound files over 60 seconds)
asynch <- gl_speech(test_audio, asynch = TRUE)

## Send to gl_speech_op() for status or finished result
gl_speech_op(asynch)


## End(Not run)

Perform text to speech

Description

Synthesizes speech synchronously: receive results after all text input has been processed.

Usage

gl_talk(
  input,
  output = "output.wav",
  languageCode = "en",
  gender = c("SSML_VOICE_GENDER_UNSPECIFIED", "MALE", "FEMALE", "NEUTRAL"),
  name = NULL,
  audioEncoding = c("LINEAR16", "MP3", "OGG_OPUS"),
  speakingRate = 1,
  pitch = 0,
  volumeGainDb = 0,
  sampleRateHertz = NULL,
  inputType = c("text", "ssml"),
  effectsProfileIds = NULL,
  forceLanguageCode = FALSE
)

Arguments

input

The text to turn into speech

output

Where to save the speech audio file

languageCode

The language of the voice as a BCP-47 language code

gender

The gender of the voice, if available

name

Name of the voice, see list via gl_talk_languages for supported voices. Set to NULL to make the service choose a voice based on languageCode and gender.

audioEncoding

Format of the requested audio stream

speakingRate

Speaking rate/speed between 0.25 and 4.0

pitch

Speaking pitch between -20.0 and 20.0 in semitones.

volumeGainDb

Volumne gain in dB

sampleRateHertz

Sample rate for returned audio

inputType

Choose between text (the default) or SSML markup. The input text must be SSML markup if you choose ssml

effectsProfileIds

Optional. An identifier which selects 'audio effects' profiles that are applied on (post synthesized) text to speech. Effects are applied on top of each other in the order they are given

forceLanguageCode

If name is provided, this will ensure that the passed languageCode is used instead of being inferred from name. This is necessary for models that require the exact code (en-us, en-gb, ...), not just the two letters shorthand (en, es, ...)

Details

Requires the Cloud Text-To-Speech API to be activated for your Google Cloud project.

Supported voices are here https://cloud.google.com/text-to-speech/docs/voices and can be imported into R via gl_talk_languages

To play the audio in code via a browser see gl_talk_player

To use Speech Synthesis Markup Language (SSML) select inputType=ssml - more details on using this to insert pauses, sounds and breaks in your audio can be found here: https://cloud.google.com/text-to-speech/docs/ssml

To use audio profiles, supply a character vector of the available audio profiles listed here: https://cloud.google.com/text-to-speech/docs/audio-profiles - the audio profiles are applied in the order given. For instance effectsProfileIds="wearable-class-device" will optimise output for smart watches, effectsProfileIds=c("wearable-class-device","telephony-class-application") will apply sound filters optimised for smart watches, then telephonic devices.

Value

The file output name you supplied as output

Examples


## Not run: 
library(magrittr)
gl_talk("The rain in spain falls mainly in the plain",
        output = "output.wav")

gl_talk("Testing my new audio player") %>% gl_talk_player()

# using SSML
gl_talk('<speak>The <say-as interpret-as=\"characters\">SSML</say-as>
  standard <break time=\"1s\"/>is defined by the
  <sub alias=\"World Wide Web Consortium\">W3C</sub>.</speak>',
  inputType =  "ssml")

# using effects profiles
gl_talk("This sounds great on headphones",
        effectsProfileIds = "headphone-class-device")


## End(Not run)

Get a list of voices available for text to speech

Description

Returns a list of voices supported for synthesis.

Usage

gl_talk_languages(languageCode = NULL)

Arguments

languageCode

A BCP-47 language tag. If specified, will only return voices that can be used to synthesize this languageCode

Play audio in a browser

Description

This uses HTML5 audio tags to play audio in your browser

Usage

gl_talk_player(audio = "output.wav", html = "player.html")

Arguments

audio

The file location of the audio file. Must be supported by HTML5

html

The html file location that will be created host the audio

Details

A platform neutral way to play audio is not easy, so this uses your browser to play it instead.

Examples


## Not run: 

gl_talk("Testing my new audio player") %>% gl_talk_player()


## End(Not run)

Speak in Shiny module (server)

Description

Call via shiny::callModule(gl_talk_shiny, "your_id")

Usage

gl_talk_shiny(
  input,
  output,
  session,
  transcript,
  ...,
  autoplay = TRUE,
  controls = TRUE,
  loop = FALSE,
  keep_wav = FALSE
)

Arguments

input

shiny input

output

shiny output

session

shiny session

transcript

The (reactive) text to talk

...

Arguments passed on to gl_talk

languageCode: The language of the voice as a BCP-47 language code
name: Name of the voice, see list via gl_talk_languages for supported voices. Set to NULL to make the service choose a voice based on languageCode and gender.
gender: The gender of the voice, if available
audioEncoding: Format of the requested audio stream
speakingRate: Speaking rate/speed between 0.25 and 4.0
pitch: Speaking pitch between -20.0 and 20.0 in semitones.
volumeGainDb: Volumne gain in dB
sampleRateHertz: Sample rate for returned audio
inputType: Choose between text (the default) or SSML markup. The input text must be SSML markup if you choose ssml
effectsProfileIds: Optional. An identifier which selects 'audio effects' profiles that are applied on (post synthesized) text to speech. Effects are applied on top of each other in the order they are given
forceLanguageCode: If name is provided, this will ensure that the passed languageCode is used instead of being inferred from name. This is necessary for models that require the exact code (en-us, en-gb, ...), not just the two letters shorthand (en, es, ...)

autoplay

passed to the HTML audio player - default TRUE plays on load

controls

passed to the HTML audio player - default TRUE shows controls

loop

passed to the HTML audio player - default FALSE does not loop

keep_wav

keep the generated wav files if TRUE.

Speak in Shiny module (ui)

Description

Speak in Shiny module (ui)

Usage

gl_talk_shinyUI(id)

Arguments

id

The Shiny id

Details

Shiny Module for use with gl_talk_shiny.

Translate the language of text within a request

Description

Translate character vectors via the Google Translate API.

Usage

gl_translate(
  t_string,
  target = "en",
  format = c("text", "html"),
  source = "",
  model = c("nmt", "base")
)

Arguments

t_string

Character vector of text to translate

target

The target language code

format

Whether the text is plain text or HTML

source

Specify the language to translate from. Will detect it if left default

model

Translation model to use

Details

You can translate a vector of strings; if too many for one call, it will be broken up into multiple API calls with smart batching. The API charges per character translated, so splitting does not change cost but may take longer.

If translating HTML, set format = "html". Consider removing anything not needed to be translated first, such as JavaScript or CSS.

API limits: characters per day, characters per 100 seconds, and API requests per 100 seconds. These can be configured in the API manager: https://console.developers.google.com/apis/api/translate.googleapis.com/quotas

Value

A tibble of translatedText, detectedSourceLanguage, and text of length equal to the vector of text you passed in

Detect the language of text within a request

Description

Detect the language of text within a request

Usage

gl_translate_detect(string)

Arguments

string

Character vector of text to detect language for

Details

Consider using library(cld2) and cld2::detect_language instead for offline detection, since that is free and does not require an API call.

gl_translate also returns a detection of the language, so you could optionally use that in one step.

Value

A tibble of the detected languages with columns confidence, isReliable, language, and text, of length equal to the vector of text you passed in.

Examples

## Not run: 
gl_translate_detect("katten sidder på måtten")

## End(Not run)

Translate a document via the Google Translate API

Description

Translate a document via the Google Translate API

Usage

gl_translate_document(
  d_path,
  target = "es-ES",
  output_path = "out.pdf",
  format = c("pdf"),
  source = "en-UK",
  model = c("nmt", "base"),
  location = "global"
)

Arguments

d_path

Path to the document to be translated

target

Target language code (default "es-ES")

output_path

Path where to save the translated document (default "out.pdf")

format

Document format. Currently, only "pdf" is supported

source

Source language code (default "en-UK")

model

Translation model to use ("nmt" or "base")

location

Location for translation API (default "global")

Value

The full path of the translated document

Examples

## Not run: 
gl_translate_document(
  system.file(package = "googleLanguageR", "test-doc.pdf"),
  target = "no"
)

## End(Not run)

Lists languages from Google Translate API

Description

Returns a list of supported languages for translation.

Usage

gl_translate_languages(target = "en")

Arguments

target

A language code for localized language names (default 'en')

Details

Supported language codes generally consist of their ISO 639-1 identifiers (e.g., 'en', 'ja'). In certain cases, BCP-47 codes including language + region identifiers are returned (e.g., 'zh-TW', 'zh-CH').

Value

A tibble of supported languages

Examples

## Not run: 
gl_translate_languages()
gl_translate_languages("da")

## End(Not run)

A helper function that tests whether an object is either NULL _or_ a list of NULLs

Description

A helper function that tests whether an object is either NULL _or_ a list of NULLs

Usage

is.NullOb(x)

Test if a string is a Google Cloud Storage URI

Description

Test if a string is a Google Cloud Storage URI

Usage

is.gcs(x)

Arguments

x

Character string

Value

Logical

Safe cbind that removes data.frames with no rows

Description

Safe cbind that removes data.frames with no rows

Usage

my_cbind(...)

Custom messaging function for package

Description

Custom messaging function for package

Usage

my_message(..., level = 1)

Arguments

...

Messages to print

level

Message level (1=low, 2=debug, 3=normal)

Recursively remove NULL objects from a list

Description

Recursively remove NULL objects from a list

Usage

rmNullObs(x)

googleLanguageR

Description

Details

Author(s)

See Also

Determine API endpoint version

Description

Usage

Arguments

Value

Authenticate with Google Language API services

Description

Usage

Arguments

Details

Examples

Perform Natural Language Analysis

Description

Usage

Arguments

Details

Value

See Also

Examples

Call Google Speech API

Description

Usage

Arguments

Details

Value

AudioEncoding

WordInfo

See Also

Examples

Get a speech operation

Description

Usage

Arguments

Value

See Also

Examples

Perform text to speech

Description

Usage

Arguments

Details

Value

See Also

Examples

Get a list of voices available for text to speech

Description

Usage

Arguments

Play audio in a browser

Description

Usage

Arguments

Details

Examples

Speak in Shiny module (server)

Description

Usage

Arguments

Speak in Shiny module (ui)

Description

Usage

Arguments

Details

Translate the language of text within a request

Description

Usage

Arguments

Details

Value

See Also

Detect the language of text within a request

Description

Usage

Arguments

Details