Statistical models for language representation
ONTARE. REVISTA DE INVESTIGACIÓN DE LA FACULTAD DE INGENIERÍA This paper discuses several models for the computational representation of language. First, some n-gram models that are based on Markov models are introduced. Second, a family of models known as the exponential models is taken into account. This family in particular allows the incorporation of several features to model. Third, a recent current of research, the probabilistic Bayesian approach, is discussed. In this kind of models, language is modeled as a probabilistic distribution. Several distributions and probabilistic processes, such as the Dirichlet distribution and the Pitman- Yor process, are used to approximate the linguistic phenomena. Finally, the problem of sparseness o... Ver más
2382-3399
2745-2220
1
2015-10-30
29
39
http://purl.org/coar/access_right/c_abf2
info:eu-repo/semantics/openAccess
Revista Ontare - 2016
id |
3f4f9672bf90e936dfa6cb08d8587fd9 |
---|---|
record_format |
ojs |
spelling |
Statistical models for language representation Universidad Ean Text http://purl.org/coar/access_right/c_abf2 info:eu-repo/semantics/openAccess http://purl.org/coar/version/c_970fb48d4fbd8a85 info:eu-repo/semantics/publishedVersion http://purl.org/redcol/resource_type/ARTREF http://purl.org/coar/resource_type/c_6501 info:eu-repo/semantics/article Revista Ontare - 2016 https://creativecommons.org/licenses/by-nc-sa/4.0/ Español https://journal.universidadean.edu.co/index.php/Revistao/article/view/1208 Revista Ontare Publication application/pdf Núm. 1 , Año 2013 : Avances tecnológicos en ingeniería ONTARE. REVISTA DE INVESTIGACIÓN DE LA FACULTAD DE INGENIERÍA This paper discuses several models for the computational representation of language. First, some n-gram models that are based on Markov models are introduced. Second, a family of models known as the exponential models is taken into account. This family in particular allows the incorporation of several features to model. Third, a recent current of research, the probabilistic Bayesian approach, is discussed. In this kind of models, language is modeled as a probabilistic distribution. Several distributions and probabilistic processes, such as the Dirichlet distribution and the Pitman- Yor process, are used to approximate the linguistic phenomena. Finally, the problem of sparseness of the language and its common solution known as smoothing is discussed.  Dorado, Rubén UNIVERSIDADES -- TRABAJOS DE GRADO-- ONTOLOGIA WEB SEMANTICA Artículo de revista 1 1 Modelos estadísticos para la representación del lenguaje ONTARE. REVISTA DE INVESTIGACIÓN DE LA FACULTAD DE INGENIERÍA Este documento discute varios modelos para la representación computacional del lenguaje. En primer lugar, se introducen los modelos de n-gramas que son basados en los modelos Markov. Luego, se toma en cuenta una familia de modelos conocido como el modelo exponencial. Esta familia en particular permite la incorporación de varias funciones para modelar. Como tercer punto, se discute una corriente reciente de la investigación, el enfoque probabilístico Bayesiano. En este tipo de modelos, el lenguaje es modelado como una distribución probabilística. Se utilizan varias distribuciones y procesos probabilísticos para aproximar los fenómenos lingüísticos, tales como la distribución de Dirichlet y el proceso de Pitman-Yor. Finalmente, se discute el problema de la escasez del lenguaje y su solución más común conocida como smoothing o redistribución. Journal article 39 https://journal.universidadean.edu.co/index.php/Revistao/article/download/1208/1176 2015-10-30 29 2015-10-30T00:00:00Z https://doi.org/10.21158/23823399.v1.n1.2013.1208 10.21158/23823399.v1.n1.2013.1208 2015-10-30T00:00:00Z 2382-3399 2745-2220 |
institution |
UNIVERSIDAD EAN |
thumbnail |
https://nuevo.metarevistas.org/UNIVERSIDADEAN/logo.png |
country_str |
Colombia |
collection |
Revista Ontare |
title |
Statistical models for language representation |
spellingShingle |
Statistical models for language representation Dorado, Rubén UNIVERSIDADES -- TRABAJOS DE GRADO-- ONTOLOGIA WEB SEMANTICA |
title_short |
Statistical models for language representation |
title_full |
Statistical models for language representation |
title_fullStr |
Statistical models for language representation |
title_full_unstemmed |
Statistical models for language representation |
title_sort |
statistical models for language representation |
title_eng |
Modelos estadísticos para la representación del lenguaje |
description |
ONTARE. REVISTA DE INVESTIGACIÓN DE LA FACULTAD DE INGENIERÍA
This paper discuses several models for the computational representation of language. First, some n-gram models that are based on Markov models are introduced. Second, a family of models known as the exponential models is taken into account. This family in particular allows the incorporation of several features to model. Third, a recent current of research, the probabilistic Bayesian approach, is discussed. In this kind of models, language is modeled as a probabilistic distribution. Several distributions and probabilistic processes, such as the Dirichlet distribution and the Pitman- Yor process, are used to approximate the linguistic phenomena. Finally, the problem of sparseness of the language and its common solution known as smoothing is discussed. 
|
description_eng |
ONTARE. REVISTA DE INVESTIGACIÓN DE LA FACULTAD DE INGENIERÍA
Este documento discute varios modelos para la representación computacional del lenguaje. En primer lugar, se introducen los modelos de n-gramas que son basados en los modelos Markov. Luego, se toma en cuenta una familia de modelos conocido como el modelo exponencial. Esta familia en particular permite la incorporación de varias funciones para modelar. Como tercer punto, se discute una corriente reciente de la investigación, el enfoque probabilístico Bayesiano. En este tipo de modelos, el lenguaje es modelado como una distribución probabilística. Se utilizan varias distribuciones y procesos probabilísticos para aproximar los fenómenos lingüísticos, tales como la distribución de Dirichlet y el proceso de Pitman-Yor. Finalmente, se discute el problema de la escasez del lenguaje y su solución más común conocida como smoothing o redistribución.
|
author |
Dorado, Rubén |
author_facet |
Dorado, Rubén |
topicspa_str_mv |
UNIVERSIDADES -- TRABAJOS DE GRADO-- ONTOLOGIA WEB SEMANTICA |
topic |
UNIVERSIDADES -- TRABAJOS DE GRADO-- ONTOLOGIA WEB SEMANTICA |
topic_facet |
UNIVERSIDADES -- TRABAJOS DE GRADO-- ONTOLOGIA WEB SEMANTICA |
citationvolume |
1 |
citationissue |
1 |
citationedition |
Núm. 1 , Año 2013 : Avances tecnológicos en ingeniería |
publisher |
Universidad Ean |
ispartofjournal |
Revista Ontare |
source |
https://journal.universidadean.edu.co/index.php/Revistao/article/view/1208 |
language |
Español |
format |
Article |
rights |
http://purl.org/coar/access_right/c_abf2 info:eu-repo/semantics/openAccess Revista Ontare - 2016 https://creativecommons.org/licenses/by-nc-sa/4.0/ |
type_driver |
info:eu-repo/semantics/article |
type_coar |
http://purl.org/coar/resource_type/c_6501 |
type_version |
info:eu-repo/semantics/publishedVersion |
type_coarversion |
http://purl.org/coar/version/c_970fb48d4fbd8a85 |
type_content |
Text |
publishDate |
2015-10-30 |
date_accessioned |
2015-10-30T00:00:00Z |
date_available |
2015-10-30T00:00:00Z |
url |
https://journal.universidadean.edu.co/index.php/Revistao/article/view/1208 |
url_doi |
https://doi.org/10.21158/23823399.v1.n1.2013.1208 |
issn |
2382-3399 |
eissn |
2745-2220 |
doi |
10.21158/23823399.v1.n1.2013.1208 |
citationstartpage |
29 |
citationendpage |
39 |
url2_str_mv |
https://journal.universidadean.edu.co/index.php/Revistao/article/download/1208/1176 |
_version_ |
1797159010275688448 |