A Tweets Mining Approach to Detection of Critical Events Characteristics using Random Forest
##plugins.themes.academic_pro.article.main##
Abstract
During a natural disaster, while most people are overwhelmed, governmental agencies are in charge of public safety, and they must timely provide true directions, by quick and ecient analysis of massive amount of information. To guaranty decision making process, it is needed to extract and organize most important information as soon as possible to avoid adding more confusion. In this paper we show an automatic analysis of complex data such as text generated during an emergency event. An unsupervised and recent popular machine learning model called random forest is trained to uncover and organize predominant features from a large set of tweets; providing a hierarchy of main variables which might indicate rules and an approximation of how information ows during an unusual event. In our work we provide, rstly, a conversion process from text to numerical vectors for the training data; secondly, we introduce brie y random forest model; next, we expose an algorithm to adapt random forest to our problem; nally we show results for dierent conguration of the model and conclusions.
##plugins.themes.academic_pro.article.details##
How to Cite
Carlos Enrique Gutierrez, Mohammad Reza Alsharif, Katsumi Yamashita, & Mahdi Khosravy. (2014). A Tweets Mining Approach to Detection of Critical Events Characteristics using Random Forest. International Journal of Next-Generation Computing, 5(2), 167–176. https://doi.org/10.47164/ijngc.v5i2.64
References
- Bosch, A., Zisserman, A., and Munoz, X. 2007. Image classification using random forests and ferns.
- Breiman, L. 2001. Random forests. Machine learning 45, 1, 5–32.
- Breiman, L., Friedman, J., Stone, C. J., and Olshen, R. A. 1984. Classification and regression trees. CRC press.
- Criminisi, A., Shotton, J., and Konukoglu, E. 2012. Decision forests: A unified framework for classification, regression, density estimation, manifold learning and semi-supervised learning. Foundations and Trends R in Computer Graphics and Vision 7, 2–3, 81–227.
- Field, C. B. 2012. Managing the risks of extreme events and disasters to advance climate change adaptation: Special report of the intergovernmental panel on climate change. Cambridge University Press.
- Geurts, P., Ernst, D., and Wehenkel, L. 2006. Extremely randomized trees. Machine learning 63, 1, 3–42.
- Gutierrez, C., Alsharif, M., Cuiwei, H., Villa, R., Yamashita, K., Miyagi, H., and Kurata, K. 2012. Natural disaster online news clustering by self-organizing maps. ishigaki, japan. In 27th SIP symposium.
- Gutierrez, C., Alsharif, M., Villa, R., Yamashita, K., and Miyagi, H. Data pattern discovery on natural disaster news. sapporo, japan. ITC-CSCC, ISBN, 978–4.
- Gutierrez, C. E., Alsharif, M. R., Cuiwei, H., Khosravy, M., Villa, R., Yamashita, K., and Miyagi, H. 2013. Uncover news dynamic by principal component analysis. Shanghai, China, ICIC Express Letters 7, 4, 1245–1250.
- Ho, T. K. 1995. Random decision forests. In Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on. Vol. 1. IEEE, 278–282.
- ISDR, U. 2009. Unisdr terminology on disaster risk reduction. Geneva, Switzerland, May.
- Nowozin, S., Rother, C., Bagon, S., Sharp, T., Yao, B., and Kohli, P. 2011. Decision tree fields. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 1668–1675.
- Porter, M. F. 1980. An algorithm for suffix stripping. Program: electronic library and information systems 14, 3, 130–137.
- Saffari, A., Leistner, C., Santner, J., Godec, M., and Bischof, H. 2009. On-line random forests. In Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on. IEEE, 1393–1400.