Summary
In this chapter, an introduction of the BigQuery and Data Studio platform to manage and view data are proposed. To start, we explored the big data and the problems associated with managing large amounts of data. Then we analyzed how to organize data in a structured format to correctly query databases.
After this, you were introduced to SQL. SQL is a language for defining and manipulating data. As a manipulation language, SQL allows to select data of interest from the base and to update its content. The queries are used both in the SQL constructs of data definition and in those of updating the database.
Then a Google BigQuery introduction was performed. BigQuery is a web service that enables interactive analysis of massively large datasets. BigQuery enables companies and developers around the world to manage large amounts of data in real time, without the support of any hardware or software investment.
Finally, we have analyzed how to create reports from our data using Google Data Studio. Google Data Studio is a FREE tool that allows us to create captivating reports quickly and easily. With Data Studio, besides being able to insert simple tables, it is possible to attach customizable graphics with various colors and fonts that will be easily understandable.
In the next chapter, we will we present Dataprep a services useful to pre-process the data, extract features and clean up the records, and Dataflow a service to implement streaming and batch processing.