Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2022-12-06T15:20:22Z
dc.date.available 2022-12-06T15:20:22Z
dc.date.issued 2022-10-17
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/146935
dc.description.abstract The enormous volume of data from different sources, really varied in its typology, generated and processed at great speed, is known as Big Data. The importance of data lies in extracting knowledge from it. Hence, being able to take advantage of a large amount of data allows us to explore and better understand the problems, providing a priori higher quality solutions. To do this, applying Machine Learning for the generation of models is essential, as well as Smart Data so that these models reflect reality and support decision-making. However, it must be noted that the Machine Learning techniques that until now have offered good results are not always able to handle Big Data due to scalability issues. For this reason, they need to be adapted to work in distributed environments, or new techniques or strategies need to be created to deal with this new scenario. In addition, datasets can usually have certain undesired characteristics or complexities that interfere with the effectiveness of the knowledge extraction process, so they must be preprocessed due to the fact that most learning models assume that the data are free of those characteristics. Therefore, and since there are few scalable solutions capable of handling Big Data related to this topic, this thesis addresses the distributed and scalable pre-processing of Big Data sets, in order to obtain good quality data, known as Smart Data. Particularly, it focuses on classification problems, and on addressing the following characteristics: (a) imbalanced data; (b) redundancy; (c) high dimensionality; and (d) overlapping. en
dc.language en es
dc.subject Big Data es
dc.subject Data preprocessing es
dc.subject Machine Learning es
dc.title Analysis and design of scalable pre-processing techniques of instances for imbalanced Big Data problems en
dc.type Articulo es
sedici.identifier.other https://doi.org/10.24215/16666038.22.e15 es
sedici.identifier.issn 1666-6038 es
sedici.title.subtitle Applications in humanitarian emergencies situations en
sedici.creator.person Basgall, María José es
sedici.description.note Resumen de la tesis defendida por el autor en mayo de 2022 en la UNLP. es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Facultad de Informática es
sedici.subtype Contribucion a revista es
sedici.rights.license Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
sedici.rights.uri http://creativecommons.org/licenses/by-nc/4.0/
sedici.relation.journalTitle Journal of Computer Science & Technology es
sedici.relation.journalVolumeAndIssue vol. 22, no. 2 es
sedici.relation.isRelatedWith http://sedici.unlp.edu.ar/handle/10915/135846 es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)