Title: | Syllabifier for CMU Dictionary Transcriptions |
---|---|
Description: | Implements tidy syllabification of transcription. Based on @kylebgorman's 'python' implementation <https://github.com/kylebgorman/syllabify>. |
Authors: | Josef Fruehwald |
Maintainer: | Josef Fruehwald <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.1.1 |
Built: | 2025-02-20 03:44:12 UTC |
Source: | https://github.com/jofrhwld/syllabifyr |
This will take a transcription as input, and return it as a data frame.
syllabify(pron, alaska_rule = T)
syllabify(pron, alaska_rule = T)
pron |
The CMU dictionary pronunciation, either as a vector, or a string with labels separated by spaces |
alaska_rule |
Don't maximize onset on lax vowel + s sequences |
Returns a data frame with the following columns
A numeric index for each syllable
What part of the syllable each phone belongs to
The phone label from the transcription
The syllable stress
# String input syllabify("AO0 S T R EY1 L Y AH0") # Vector input syllabify(c("AO0", "S", "T", "R", "EY1", "L", "Y", "AH0")) # Hiatus syllabify("HH AY0 EY1 T AH0 S") # Deficient transcriptions (has warning) syllabify(c("M"))
# String input syllabify("AO0 S T R EY1 L Y AH0") # Vector input syllabify(c("AO0", "S", "T", "R", "EY1", "L", "Y", "AH0")) # Hiatus syllabify("HH AY0 EY1 T AH0 S") # Deficient transcriptions (has warning) syllabify(c("M"))
This will take a transcription as input, and return it as a list.
syllabify_list(pron, alaska_rule = TRUE)
syllabify_list(pron, alaska_rule = TRUE)
pron |
The CMU dictionary pronunciation, either as a vector, or a string with labels separated by spaces |
alaska_rule |
Don't maximize onset on lax vowel + s sequences |
A with one value per syllable. Each value is a list, with three
values: onset, nucleus, coda. Each will contain a vector of the phones
which belong to each constituent part of the syllable. Any empty
constituent parts will have the value character(0)
# String input syllabify_list("AO0 S T R EY1 L Y AH0") # Vector input syllabify_list(c("AO0", "S", "T", "R", "EY1", "L", "Y", "AH0")) # Hiatus syllabify_list("HH AY0 EY1 T AH0 S") # Deficient transcriptions (has warning) syllabify_list(c("M"))
# String input syllabify_list("AO0 S T R EY1 L Y AH0") # Vector input syllabify_list(c("AO0", "S", "T", "R", "EY1", "L", "Y", "AH0")) # Hiatus syllabify_list("HH AY0 EY1 T AH0 S") # Deficient transcriptions (has warning) syllabify_list(c("M"))
This is a package to do tidy syllabification of phonetic transcriptions. The syllabifier "maximizes onset". The algorithmic approach to this is adapted from Kyle Gorman's python implementation (https://github.com/kylebgorman/syllabify)
The key function is syllabify()
. Given a CMU transcription,
it will return a tibble. See ?syllabify()
for more info.
Also available is syllabify_list()
. This is a list representation
of the syllables. See ?syllabify_list()
for more info.