Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC

Detalhes bibliográficos
Autor(a) principal: Toni, Andressa
Data de Publicação: 2021
Tipo de documento: Artigo
Idioma: por
Título da fonte: Revista da ABRALIN (Online)
Texto Completo: https://revista.abralin.org/index.php/abralin/article/view/1801
Resumo: This paper aims to introduce to the linguistic community a new linguistic resource directed to Language Acquisition studies: the Child Speech Corpus (Corpus FI) and the Child Directed Speech Corpus (Corpus FDC). We built these corpora based on the naturalistic database of Santos (2005) and the computational tools of Benevides e Guide (2016). The corpora consist of a list of frequencies where the researcher can find phonological and morphological information (phonological transcription, stress transcription, syllabic structure, stress category, lexical category, lemma) extracted from the speech productions of 3 children (Corpus FI) and their mothers/caregivers (Corpus FDC). The goal of the paper is i) to describe the methods used in the corpora compilation, providing a basic usage guide; and ii) to show how these data can contribute to the language development research field. For that, we compare the segmental and prosodic frequencies of CCV syllables (Consonant1+Consonant2+Vowel) in adult speech, child directed speech and child speech, establishing how input frequencies influences children’s phonological acquisition path. Results point out to a similarity on CCV’s prosodic and segmental properties between the three corpora. CCV is mostly realized in prosodically salient positions, being usually restricted to the same consonant sequences. Due to CCV’s low frequency of use, low minimal pairs count and phonologically opaque contexts, we claim that input frequency is a factor that contributes to the long path of acquisition of this syllable type, which emerges before 2;0 years old and is acquired only between 5;0-6;0 years old.
id UFPR-12_0131447cb42f3cd82ce96db002f25974
oai_identifier_str oai:ojs.revista.ojs.abralin.org:article/1801
network_acronym_str UFPR-12
network_name_str Revista da ABRALIN (Online)
repository_id_str
spelling Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDCFrequência lexical dos ataques ramificados CCV em Português Brasileiro: comparando a fala adulta, a fala dirigida à criança e a fala infantil nos corpora FI e FDCLinguística de CorpusFala infantilFala dirigida à criançaAtaques ramificadosCorpus LinguisticsChild SpeechChild Directed SpeechOnset clustersThis paper aims to introduce to the linguistic community a new linguistic resource directed to Language Acquisition studies: the Child Speech Corpus (Corpus FI) and the Child Directed Speech Corpus (Corpus FDC). We built these corpora based on the naturalistic database of Santos (2005) and the computational tools of Benevides e Guide (2016). The corpora consist of a list of frequencies where the researcher can find phonological and morphological information (phonological transcription, stress transcription, syllabic structure, stress category, lexical category, lemma) extracted from the speech productions of 3 children (Corpus FI) and their mothers/caregivers (Corpus FDC). The goal of the paper is i) to describe the methods used in the corpora compilation, providing a basic usage guide; and ii) to show how these data can contribute to the language development research field. For that, we compare the segmental and prosodic frequencies of CCV syllables (Consonant1+Consonant2+Vowel) in adult speech, child directed speech and child speech, establishing how input frequencies influences children’s phonological acquisition path. Results point out to a similarity on CCV’s prosodic and segmental properties between the three corpora. CCV is mostly realized in prosodically salient positions, being usually restricted to the same consonant sequences. Due to CCV’s low frequency of use, low minimal pairs count and phonologically opaque contexts, we claim that input frequency is a factor that contributes to the long path of acquisition of this syllable type, which emerges before 2;0 years old and is acquired only between 5;0-6;0 years old.Este artigo apresenta à comunidade linguística o Corpus de Fala Infantil (Corpus FI) e o Corpus de Fala Dirigida à Criança (Corpus FDC), uma nova base de dados voltada aos estudos sobre Aquisição da Linguagem. Estes corpora foram compilados a partir do banco de dados longitudinais de Santos (2005) utilizando as ferramentas computacionais de Benevides e Guide (2016). Os corpora consistem em uma lista de frequências contendo informações fonológicas (transcrição fonológica, transcrição acentual, estrutura silábica, categoria acentual) e morfológicas (categoria lexical e lema) das palavras coletadas na fala de 3 crianças (Corpus FI) e de seus cuidadores (Corpus FDC). Para divulgar esses corpora de acesso livre, este artigo i) descreve a metodologia utilizada em sua compilação e manuseio; e ii) oferece um exemplo sobre como estes corpora podem contribuir às pesquisas sobre o desenvolvimento linguístico infantil. Para tanto, comparamos as frequências segmental e prosódica das sílabas CCV (Consoante1+Consoante2+Vogal) na fala adulta, na fala dirigida à criança e na fala infantil demonstrando como a frequência do input influencia o percurso da aquisição fonológica. Os resultados apontam congruência na composição prosódica e segmental dos corpora, com CCV majoritariamente ocupando posições de saliência prosódica e apresentando concentração em sequências consonantais específicas. Dada a baixa frequência geral de CCV, baixo número de pares mínimos CV-CCV e existência de contextos de baixa transparência fonológica, defendemos que o input é um fator que contribui ao longo percurso de aquisição deste tipo silábico, que surge na fala infantil antes dos 2;0 anos e só se estabiliza entre 5;0-6;0 anos.Associação Brasileira de Linguística2021-06-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontextoinfo:eu-repo/semantics/otherapplication/pdftext/xmlhttps://revista.abralin.org/index.php/abralin/article/view/180110.25189/rabralin.v20i1.1801Revista da ABRALIN; V. 20, N. 1 (2021); 1-33Revista da ABRALIN; V. 20, N. 1 (2021); 1-330102-715810.25189/rabralin.v20i1reponame:Revista da ABRALIN (Online)instname:Universidade Federal do Paraná (UFPR)instacron:UFPRporhttps://revista.abralin.org/index.php/abralin/article/view/1801/2174https://revista.abralin.org/index.php/abralin/article/view/1801/2175Copyright (c) 2021 Andressa Toniinfo:eu-repo/semantics/openAccessToni, Andressa2021-06-25T12:41:24Zoai:ojs.revista.ojs.abralin.org:article/1801Revistahttps://revista.abralin.org/index.php/abralinPUBhttps://revista.abralin.org/index.php/abralin/oairkofreitag@uol.com.br || ra@abralin.org2178-76031678-1805opendoar:2021-06-25T12:41:24Revista da ABRALIN (Online) - Universidade Federal do Paraná (UFPR)false
dc.title.none.fl_str_mv Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
Frequência lexical dos ataques ramificados CCV em Português Brasileiro: comparando a fala adulta, a fala dirigida à criança e a fala infantil nos corpora FI e FDC
title Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
spellingShingle Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
Toni, Andressa
Linguística de Corpus
Fala infantil
Fala dirigida à criança
Ataques ramificados
Corpus Linguistics
Child Speech
Child Directed Speech
Onset clusters
title_short Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
title_full Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
title_fullStr Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
title_full_unstemmed Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
title_sort Lexical frequency of CCV onset clusters in Brazilian Portuguese: comparing adult speech, child directed speech and child speech in the open corpora FI and FDC
author Toni, Andressa
author_facet Toni, Andressa
author_role author
dc.contributor.author.fl_str_mv Toni, Andressa
dc.subject.por.fl_str_mv Linguística de Corpus
Fala infantil
Fala dirigida à criança
Ataques ramificados
Corpus Linguistics
Child Speech
Child Directed Speech
Onset clusters
topic Linguística de Corpus
Fala infantil
Fala dirigida à criança
Ataques ramificados
Corpus Linguistics
Child Speech
Child Directed Speech
Onset clusters
description This paper aims to introduce to the linguistic community a new linguistic resource directed to Language Acquisition studies: the Child Speech Corpus (Corpus FI) and the Child Directed Speech Corpus (Corpus FDC). We built these corpora based on the naturalistic database of Santos (2005) and the computational tools of Benevides e Guide (2016). The corpora consist of a list of frequencies where the researcher can find phonological and morphological information (phonological transcription, stress transcription, syllabic structure, stress category, lexical category, lemma) extracted from the speech productions of 3 children (Corpus FI) and their mothers/caregivers (Corpus FDC). The goal of the paper is i) to describe the methods used in the corpora compilation, providing a basic usage guide; and ii) to show how these data can contribute to the language development research field. For that, we compare the segmental and prosodic frequencies of CCV syllables (Consonant1+Consonant2+Vowel) in adult speech, child directed speech and child speech, establishing how input frequencies influences children’s phonological acquisition path. Results point out to a similarity on CCV’s prosodic and segmental properties between the three corpora. CCV is mostly realized in prosodically salient positions, being usually restricted to the same consonant sequences. Due to CCV’s low frequency of use, low minimal pairs count and phonologically opaque contexts, we claim that input frequency is a factor that contributes to the long path of acquisition of this syllable type, which emerges before 2;0 years old and is acquired only between 5;0-6;0 years old.
publishDate 2021
dc.date.none.fl_str_mv 2021-06-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
texto
info:eu-repo/semantics/other
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://revista.abralin.org/index.php/abralin/article/view/1801
10.25189/rabralin.v20i1.1801
url https://revista.abralin.org/index.php/abralin/article/view/1801
identifier_str_mv 10.25189/rabralin.v20i1.1801
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://revista.abralin.org/index.php/abralin/article/view/1801/2174
https://revista.abralin.org/index.php/abralin/article/view/1801/2175
dc.rights.driver.fl_str_mv Copyright (c) 2021 Andressa Toni
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2021 Andressa Toni
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
text/xml
dc.publisher.none.fl_str_mv Associação Brasileira de Linguística
publisher.none.fl_str_mv Associação Brasileira de Linguística
dc.source.none.fl_str_mv Revista da ABRALIN; V. 20, N. 1 (2021); 1-33
Revista da ABRALIN; V. 20, N. 1 (2021); 1-33
0102-7158
10.25189/rabralin.v20i1
reponame:Revista da ABRALIN (Online)
instname:Universidade Federal do Paraná (UFPR)
instacron:UFPR
instname_str Universidade Federal do Paraná (UFPR)
instacron_str UFPR
institution UFPR
reponame_str Revista da ABRALIN (Online)
collection Revista da ABRALIN (Online)
repository.name.fl_str_mv Revista da ABRALIN (Online) - Universidade Federal do Paraná (UFPR)
repository.mail.fl_str_mv rkofreitag@uol.com.br || ra@abralin.org
_version_ 1798329771807997952