Every day, large organizations update themselves with the technologies that facilitate and adapt better to each company, facing great challenges that allow them to discover and analyze, in addition to the tools that are used on a day-to-day basis, it is for them that it was created. such as Big Data or in Spanish massive data, which are large-scale data storage systems.
This storage phenomenon is part of the new information and communication technologies. Big Data is the one that occupies all the activities related to the systems that store a large set of data.
One of the main features is that it handles a lot of information, collecting it, classifying it and storing it. The purpose of this collection is to create statistical reports for use by organizations, whether as analysis of business plans, advertising, espionage, among others.
The storage margin has grown over the years, since 2008 the storage level is measured in petabytes to zetabytes of data. Specialists regularly look for new storage measures because there are certain areas where it is necessary to store large amounts of data and existing programs are not very suitable.
There are thousands of tools to create and manage Big Data, but not all are the same, there are three types of Data, which are:
Structured Data: are those in which the data has a very particular structure, such as dates, numbers, among others. An example of them are spreadsheets. Unstructured data: This is usually data that has a specific format and cannot be stored in a spreadsheet, much less manipulate the information, such as PDF documents. Semi-structured data: this type of data does not have a particular format, since it has its own semi-structured metadata, an example of which is HTML codes.