Research Article Open Access

Data Warehouse Design to Support Social Media Analysis in a Big Data Environment 

Carlos Roberto Valêncio1, Luis Marcello Moraes Silva1, William Tenório1, Geraldo Francisco Donegá Zafalon1, Angelo Cesar Colombini2 and Márcio Zamboti Fortes2
  • 1 São Paulo State University (Unesp), Brazil
  • 2 Fluminense Federal University (UFF), Brazil


The volume of generated and stored data from social media has increased in the last decade. Therefore, analyzing and understanding this kind of data can offer relevant information in different contexts and can assist researchers and companies in the decision-making process. However, the data are scattered in a large volume, come from different sources, with different formats and are rapidly created. Such facts make the knowledge extraction difficult, turning it in a complex and high costly process. The scientific contribution of this paper is the development of a social media data integration model based on a data warehouse to reduce the computational costs related to data analysis, as well as support the application of techniques to discover useful knowledge. Differently from the literature, we focus on both social media Facebook and Twitter. Also, we contribute with the proposition of a model for the acquisition, transformation and loading data, which can enable the extraction of useful knowledge in a context where the human capability of understanding is exceeded. The results showed that the proposed data warehouse improves the quality of data mining algorithms compared to related works, while being able to reduce the execution time.

Journal of Computer Science
Volume 16 No. 2, 2020, 126-136


Submitted On: 30 September 2019 Published On: 20 February 2020

How to Cite: Valêncio, C. R., Silva, L. M. M., Tenório, W., Zafalon, G. F. D., Colombini, A. C. & Fortes, M. Z. (2020). Data Warehouse Design to Support Social Media Analysis in a Big Data Environment . Journal of Computer Science, 16(2), 126-136.

  • 2 Citations



  • Social Media
  • Data Warehouse
  • Data Mining
  • Big Data