dc.contributor.author | Jain, Shubham | |
dc.contributor.author | de Buitléir, Amy | |
dc.contributor.author | Fallon, Enda | |
dc.date.accessioned | 2020-11-13T11:48:29Z | |
dc.date.available | 2020-11-13T11:48:29Z | |
dc.date.copyright | 2020 | |
dc.date.issued | 2020-03 | |
dc.identifier.citation | Jain, S., de Buitléir, A., Fallon, E. (2020). A review of unstructured data analysis and parsing methods. In 2020 International Conference on Emerging Smart Computing and Informatics (ESCI), Pune, India, 2020, pp. 164-169, doi: 10.1109/ESCI48226.2020.9167588. | en_US |
dc.identifier.other | Conferences - Software Research Institute - AIT | en_US |
dc.identifier.uri | http://research.thea.ie/handle/20.500.12065/3475 | |
dc.description.abstract | Computer applications generate an enormous amount of data every day through their logs, system-generated files or other reports. This generated data depicts the state of the running system and contains abundant information that can be used for system diagnostics and monitoring. Network monitoring systems produce a wide variety of unstructured information, so there is a need for an automated way to extract the relevant data, which currently requires multitude of custom parsers. Developing and testing custom parsers can be time-consuming. Instead, data can be automatically processed and parsed into a machine-readable format, building a generic model for standard or vendor-specific data, and generating insights for analytics, anomaly detection, intrusion detection, node failures and various other applications. This paper reviews some existing approaches for unstructured data mining and parsing and discusses the challenges in information extraction, creation of knowledge bases and presents a generic framework for automatic parsing. | en_US |
dc.format | PDF | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | 2020 International Conference on Emerging Smart Computing and Informatics (ESCI) | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | * |
dc.subject | Data mining | en_US |
dc.subject | Information | en_US |
dc.subject | Extraction | en_US |
dc.subject | Similarity | en_US |
dc.subject | NLP | en_US |
dc.subject | Knowledge base | en_US |
dc.title | A review of unstructured data analysis and parsing methods | en_US |
dc.type | Other | en_US |
dc.contributor.sponsor | Irish Research Council Enterprise Partnership Scheme Postgraduate Scholarship 2020 under Project EPSPG/2020/7 | en_US |
dc.description.peerreview | yes | en_US |
dc.identifier.conference | 2020 International Conference on Emerging Smart Computing and Informatics (ESCI) | |
dc.identifier.doi | doi: 10.1109/ESCI48226.2020.9167588. | |
dc.identifier.orcid | https://orcid.org/0000-0002-8300-5813 | |
dc.rights.access | Open Access | en_US |
dc.subject.department | Software Research Institute AIT | en_US |