Use este identificador para citar ou linkar para este item: http://www.alice.cnptia.embrapa.br/alice/handle/doc/5283
Registro completo de metadados
Campo DCValorIdioma
dc.contributor.authorNOGUEIRA, B. M.pt_BR
dc.contributor.authorMOURA, M. F.pt_BR
dc.contributor.authorCONRADO, M. da S.pt_BR
dc.contributor.authorROSSI, R. G.pt_BR
dc.contributor.authorMARCACINI, R. M.pt_BR
dc.contributor.authorREZENDE, S. O.pt_BR
dc.date.accessioned2011-04-10T11:11:11Zpt_BR
dc.date.available2011-04-10T11:11:11Zpt_BR
dc.date.created2008-11-25pt_BR
dc.date.issued2008pt_BR
dc.identifier.citationIn: SIMPÓSIO BRASILEIRO DE BANCO DE DADOS, 23.; SIMPÓSIO BRASILEIRO DE ENGENHARIA DE SOFTWARE, 22.; WORKSHOP EM ALGORITMOS E APLICAÇÕES DE MINERAÇÃO DE DADOS, 4., 2008, Campinas. Anais... Campinas: UNICAMP, Instituto de Computação, 2008.pt_BR
dc.identifier.urihttp://www.alice.cnptia.embrapa.br/alice/handle/doc/5283pt_BR
dc.descriptionConsidering the huge growth of the number of documents in the digital universe and the possibility of obtaining some competitive advantage in processing them, this paper describes some of the difficulties of working with text collections. More specifically, it shows some of the challenges on the step considered one of the most important of the Text Mining process - the data preprocessing - focusing on two of its main tasks: attribute generation and selection, considering not only single terms but composed terms too. In order to overcome the challenges imposed by these problems, this paper presents efficient unsupervised solutions. The application of these solutions in three real data sets is presented in order to evaluate them and to show a way to treat the data step by step. Good results were obtained at the end of the whole process.pt_BR
dc.language.isoengeng
dc.rightsopenAccesseng
dc.subjectDados semânticospt_BR
dc.subjectMineração de textospt_BR
dc.subjectText miningpt_BR
dc.titleWinning some of the document preprocessing challenges in a text mining process.pt_BR
dc.typeArtigo em anais e proceedingspt_BR
dc.date.updated2020-01-31T11:11:11Zpt_BR
dc.format.extent2p. 10-18.pt_BR
riaa.ainfo.id5283pt_BR
riaa.ainfo.lastupdate2020-01-31 -02:00:00pt_BR
dc.contributor.institutionBRUNO MAGALHÃES NOGUEIRA, USP; MARIA FERNANDA MOURA, CNPTIA; MERLEY DA SILVA CONRADO, USP; RAFAEL GERALDELI ROSSI, USP; RICARDO MARCONDES MARCACINI, USP; SOLANGE OLIVEIRA REZENDE, USP.pt_BR
Aparece nas coleções:Artigo em anais de congresso (CNPTIA)

Arquivos associados a este item:
Arquivo Descrição TamanhoFormato 
winning.pdf678,57 kBAdobe PDFThumbnail
Visualizar/Abrir

FacebookTwitterDeliciousLinkedInGoogle BookmarksMySpace