Understanding Big Data – Analytics for Enterprise Class Hadoop and Streaming Data

来源:互联网 发布:游戏环境检测软件 编辑:程序博客网 时间:2024/05/29 04:36

Big Data is difficultto handle is because it is sitting in its most raw form or in a semi structuredor unstructured format.

On a railway car,these sensors track such things as the conditions experienced by the rail car,the state of individual parts, and GPS-based data for shipment tracking andlogistics. After train derailments that claimed extensive losses of life, governmentsintroduced regulations that this kind of data be stored and analyzed to presentfuture disasters. Rail cars are also becoming more intelligent: processors havebeen added to interpret sensor data on parts prone to wear, such as bearings,to identify parts that need repair before they fail and cause further damage-orworse, disaster.

Threecharacteristics define Big Data: volume, variety, and velocity.

We expect datanumber to reach 35 zettabytes(ZB) by 2020, Twitter alone generates more than 7terabytes(TB) of data every day, Facebook 10 TB, and some enterprises generate terabytesof data every hour of every day of the year.

待续