🗊Презентация Big data concepts and tools

Категория: Информатика
Нажмите для полного просмотра!
Big data concepts and tools, слайд №1Big data concepts and tools, слайд №2Big data concepts and tools, слайд №3Big data concepts and tools, слайд №4Big data concepts and tools, слайд №5Big data concepts and tools, слайд №6Big data concepts and tools, слайд №7Big data concepts and tools, слайд №8Big data concepts and tools, слайд №9Big data concepts and tools, слайд №10Big data concepts and tools, слайд №11Big data concepts and tools, слайд №12Big data concepts and tools, слайд №13Big data concepts and tools, слайд №14Big data concepts and tools, слайд №15Big data concepts and tools, слайд №16

Вы можете ознакомиться и скачать презентацию на тему Big data concepts and tools. Доклад-сообщение содержит 16 слайдов. Презентации для любого класса можно скачать бесплатно. Если материал и наш сайт презентаций Mypresentation Вам понравились – поделитесь им с друзьями с помощью социальных кнопок и добавьте в закладки в своем браузере.

Слайды и текст этой презентации


Слайд 1





big data concepts and tools
PERFORMED BY: Bondarev Petr, 433
Описание слайда:
big data concepts and tools PERFORMED BY: Bondarev Petr, 433

Слайд 2





introduction
The term "Big Data" has launched a veritable industry of processes, personnel and technology to support what appears to be an exploding new field. Giant companies like Amazon and Wal-Mart as well as bodies such as the U.S. government and NASA are using Big Data to meet their business and/or strategic objectives. Big Data can also play a role for small or medium-sized companies and organizations that recognize the possibilities (which can be incredibly diverse) to capitalize upon the gains.
Описание слайда:
introduction The term "Big Data" has launched a veritable industry of processes, personnel and technology to support what appears to be an exploding new field. Giant companies like Amazon and Wal-Mart as well as bodies such as the U.S. government and NASA are using Big Data to meet their business and/or strategic objectives. Big Data can also play a role for small or medium-sized companies and organizations that recognize the possibilities (which can be incredibly diverse) to capitalize upon the gains.

Слайд 3


Big data concepts and tools, слайд №3
Описание слайда:

Слайд 4





Why Are Big Data Systems Different?
					An exact definition of "big data" is 					difficult to nail down because projects, vendors, 			practitioners, and business professionals use it quite 	differently. With that in mind, generally speaking, big data is:
large datasets
the category of computing strategies and technologies that are used to handle large datasets
Описание слайда:
Why Are Big Data Systems Different? An exact definition of "big data" is difficult to nail down because projects, vendors, practitioners, and business professionals use it quite differently. With that in mind, generally speaking, big data is: large datasets the category of computing strategies and technologies that are used to handle large datasets

Слайд 5





Why Are Big Data Systems Different?
The basic requirements for working with big data are the same as the requirements for working with datasets of any size. However, the massive scale, the speed of ingesting and processing, and the characteristics of the data that must be dealt with at each stage of the process present significant new challenges when designing solutions. The goal of most big data systems is to surface insights and connections from large volumes of heterogeneous data that would not be possible using conventional methods.
In 2001, Gartner's Doug Laney first presented what became known as the "three Vs of big data" to describe some of the characteristics that make big data different from other data processing:
Описание слайда:
Why Are Big Data Systems Different? The basic requirements for working with big data are the same as the requirements for working with datasets of any size. However, the massive scale, the speed of ingesting and processing, and the characteristics of the data that must be dealt with at each stage of the process present significant new challenges when designing solutions. The goal of most big data systems is to surface insights and connections from large volumes of heterogeneous data that would not be possible using conventional methods. In 2001, Gartner's Doug Laney first presented what became known as the "three Vs of big data" to describe some of the characteristics that make big data different from other data processing:

Слайд 6


Big data concepts and tools, слайд №6
Описание слайда:

Слайд 7





Other Characteristics
Veracity: The variety of sources and the complexity of the processing can lead to challenges in evaluating the quality of the data (and consequently, the quality of the resulting analysis)
Variability: Variation in the data leads to wide variation in quality. Additional resources may be needed to identify, process, or filter low quality data to make it more useful.
Value: The ultimate challenge of big data is delivering value. Sometimes, the systems and processes in place are complex enough that using the data and extracting actual value can become difficult.
Описание слайда:
Other Characteristics Veracity: The variety of sources and the complexity of the processing can lead to challenges in evaluating the quality of the data (and consequently, the quality of the resulting analysis) Variability: Variation in the data leads to wide variation in quality. Additional resources may be needed to identify, process, or filter low quality data to make it more useful. Value: The ultimate challenge of big data is delivering value. Sometimes, the systems and processes in place are complex enough that using the data and extracting actual value can become difficult.

Слайд 8


Big data concepts and tools, слайд №8
Описание слайда:

Слайд 9





tools
There are thousands of Big Data tools out there for data analysis today. Data analysis is the process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, suggesting conclusions, and supporting decision making.
Описание слайда:
tools There are thousands of Big Data tools out there for data analysis today. Data analysis is the process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, suggesting conclusions, and supporting decision making.

Слайд 10





Great product from Apache that has been used by many large corporations. Among the most important features of this advanced software library is superior processing of voluminous data sets in clusters of computers using effective programming models. Corporations choose Hadoop because of its great processing capabilities plus developer provides regular updates and improvements to the product.
Great product from Apache that has been used by many large corporations. Among the most important features of this advanced software library is superior processing of voluminous data sets in clusters of computers using effective programming models. Corporations choose Hadoop because of its great processing capabilities plus developer provides regular updates and improvements to the product.
Описание слайда:
Great product from Apache that has been used by many large corporations. Among the most important features of this advanced software library is superior processing of voluminous data sets in clusters of computers using effective programming models. Corporations choose Hadoop because of its great processing capabilities plus developer provides regular updates and improvements to the product. Great product from Apache that has been used by many large corporations. Among the most important features of this advanced software library is superior processing of voluminous data sets in clusters of computers using effective programming models. Corporations choose Hadoop because of its great processing capabilities plus developer provides regular updates and improvements to the product.

Слайд 11





This tool is widely used today because it provides an effective management of large amounts of data. It is a database that offers high availability and scalability without compromising the performance of commodity hardware and cloud infrastructure. Among the main advantages of Cassandra highlighted by the development are fault tolerance, performance, decentralization, professional support, durability, elasticity, and scalability. Indeed, such users of Cassandra as eBay and Netflix may prove them.
This tool is widely used today because it provides an effective management of large amounts of data. It is a database that offers high availability and scalability without compromising the performance of commodity hardware and cloud infrastructure. Among the main advantages of Cassandra highlighted by the development are fault tolerance, performance, decentralization, professional support, durability, elasticity, and scalability. Indeed, such users of Cassandra as eBay and Netflix may prove them.
Описание слайда:
This tool is widely used today because it provides an effective management of large amounts of data. It is a database that offers high availability and scalability without compromising the performance of commodity hardware and cloud infrastructure. Among the main advantages of Cassandra highlighted by the development are fault tolerance, performance, decentralization, professional support, durability, elasticity, and scalability. Indeed, such users of Cassandra as eBay and Netflix may prove them. This tool is widely used today because it provides an effective management of large amounts of data. It is a database that offers high availability and scalability without compromising the performance of commodity hardware and cloud infrastructure. Among the main advantages of Cassandra highlighted by the development are fault tolerance, performance, decentralization, professional support, durability, elasticity, and scalability. Indeed, such users of Cassandra as eBay and Netflix may prove them.

Слайд 12





This tool makes the list because of its superior streaming data processing capabilities in real time. It also integrates with many other tools such as Apache Slider to manage and secure the data. The use cases of Storm include data monetization, real time customer management, cyber security analytics, operational dashboards, and threat detection. These functions provide awesome business opportunities.
This tool makes the list because of its superior streaming data processing capabilities in real time. It also integrates with many other tools such as Apache Slider to manage and secure the data. The use cases of Storm include data monetization, real time customer management, cyber security analytics, operational dashboards, and threat detection. These functions provide awesome business opportunities.
Описание слайда:
This tool makes the list because of its superior streaming data processing capabilities in real time. It also integrates with many other tools such as Apache Slider to manage and secure the data. The use cases of Storm include data monetization, real time customer management, cyber security analytics, operational dashboards, and threat detection. These functions provide awesome business opportunities. This tool makes the list because of its superior streaming data processing capabilities in real time. It also integrates with many other tools such as Apache Slider to manage and secure the data. The use cases of Storm include data monetization, real time customer management, cyber security analytics, operational dashboards, and threat detection. These functions provide awesome business opportunities.

Слайд 13





The HPCC platform combines a range of big data analysis tools. It is a package solution with tools for data profiling, cleansing, job scheduling and automation. Like Hadoop, it also leverages commodity computing clusters to provide high-performance, parallel data processing for big data applications.
The HPCC platform combines a range of big data analysis tools. It is a package solution with tools for data profiling, cleansing, job scheduling and automation. Like Hadoop, it also leverages commodity computing clusters to provide high-performance, parallel data processing for big data applications.
It uses ECL (a language specially designed to work with big data) as the scripting language for ETL engine. The HPCC platform supports both parallel batch data processing (Thor) and real-time query applications using indexed data files (Roxie).
Описание слайда:
The HPCC platform combines a range of big data analysis tools. It is a package solution with tools for data profiling, cleansing, job scheduling and automation. Like Hadoop, it also leverages commodity computing clusters to provide high-performance, parallel data processing for big data applications. The HPCC platform combines a range of big data analysis tools. It is a package solution with tools for data profiling, cleansing, job scheduling and automation. Like Hadoop, it also leverages commodity computing clusters to provide high-performance, parallel data processing for big data applications. It uses ECL (a language specially designed to work with big data) as the scripting language for ETL engine. The HPCC platform supports both parallel batch data processing (Thor) and real-time query applications using indexed data files (Roxie).

Слайд 14





Elasticsearch is a dependable and safe open source platform where you can take any data from any source, in any format and search, analyze it and envision it in real time. Elasticsearch is designed for horizontal scalability, reliability, and ease of management.  All of this achieved while combining the speed of search with the potential of analytics. It is based on Lucene a retrieval software library originally compiled in Java. It uses a developer-friendly, JSON-style, query language that works well for structured, unstructured and time-series data.
Elasticsearch is a dependable and safe open source platform where you can take any data from any source, in any format and search, analyze it and envision it in real time. Elasticsearch is designed for horizontal scalability, reliability, and ease of management.  All of this achieved while combining the speed of search with the potential of analytics. It is based on Lucene a retrieval software library originally compiled in Java. It uses a developer-friendly, JSON-style, query language that works well for structured, unstructured and time-series data.
Описание слайда:
Elasticsearch is a dependable and safe open source platform where you can take any data from any source, in any format and search, analyze it and envision it in real time. Elasticsearch is designed for horizontal scalability, reliability, and ease of management.  All of this achieved while combining the speed of search with the potential of analytics. It is based on Lucene a retrieval software library originally compiled in Java. It uses a developer-friendly, JSON-style, query language that works well for structured, unstructured and time-series data. Elasticsearch is a dependable and safe open source platform where you can take any data from any source, in any format and search, analyze it and envision it in real time. Elasticsearch is designed for horizontal scalability, reliability, and ease of management.  All of this achieved while combining the speed of search with the potential of analytics. It is based on Lucene a retrieval software library originally compiled in Java. It uses a developer-friendly, JSON-style, query language that works well for structured, unstructured and time-series data.

Слайд 15





Thanks for your attention!
Описание слайда:
Thanks for your attention!

Слайд 16





Sources
https://www.digitalocean.com/community/tutorials/an-introduction-to-big-data-concepts-and-terminology
https://www.techrepublic.com/blog/big-data-analytics/big-data-basic-concepts-and-benefits-explained/
https://bluewavebuzzblog.wordpress.com/2014/03/20/what-is-big-data-and-how-can-it-help-marketers/
http://bigdata-madesimple.com/top-10-tools-for-working-with-big-data-for-successful-analytics-developers-2/
https://www.newgenapps.com/blog/top-best-open-source-big-data-tools
Описание слайда:
Sources https://www.digitalocean.com/community/tutorials/an-introduction-to-big-data-concepts-and-terminology https://www.techrepublic.com/blog/big-data-analytics/big-data-basic-concepts-and-benefits-explained/ https://bluewavebuzzblog.wordpress.com/2014/03/20/what-is-big-data-and-how-can-it-help-marketers/ http://bigdata-madesimple.com/top-10-tools-for-working-with-big-data-for-successful-analytics-developers-2/ https://www.newgenapps.com/blog/top-best-open-source-big-data-tools



Похожие презентации
Mypresentation.ru
Загрузить презентацию