Diverse structured, unstructured, and semi-structured Data that were generated from the various sources need to be reduced to the same standard for the data to be understandable and flow among
Diverse structured, unstructured, and semi-structured Data that were generated from the various sources need to be reduced to the same standard for the data to be understandable and flow among diverse systems involved in processing the data. Big Data consists of heterogeneous datasets from many sources and the datasets need to be reduced to the same format. for systems interoperability. Some of the formatting tools include XML, AVro, JSON and Parquet. Discus the roles XML, AVro, and JSON, which are the popular data formatting tools in Big Data standardization. Discuss the need for Big Data standardization. List the various tools that can be used to achieve Big Data Standardization is XML? is AVro? is JSON? Discuss the roles of XML, AVro and JSON in Big Data formatting. Useful links: XML Jason Intro Jason & Big Data
Leave a Reply
Want to join the discussion?Feel free to contribute!