🗊Презентация High Performance Deep Learning on Intel Architecture

Нажмите для полного просмотра!
High Performance Deep Learning on Intel Architecture, слайд №1High Performance Deep Learning on Intel Architecture, слайд №2High Performance Deep Learning on Intel Architecture, слайд №3High Performance Deep Learning on Intel Architecture, слайд №4High Performance Deep Learning on Intel Architecture, слайд №5High Performance Deep Learning on Intel Architecture, слайд №6High Performance Deep Learning on Intel Architecture, слайд №7High Performance Deep Learning on Intel Architecture, слайд №8High Performance Deep Learning on Intel Architecture, слайд №9High Performance Deep Learning on Intel Architecture, слайд №10High Performance Deep Learning on Intel Architecture, слайд №11High Performance Deep Learning on Intel Architecture, слайд №12High Performance Deep Learning on Intel Architecture, слайд №13High Performance Deep Learning on Intel Architecture, слайд №14High Performance Deep Learning on Intel Architecture, слайд №15High Performance Deep Learning on Intel Architecture, слайд №16High Performance Deep Learning on Intel Architecture, слайд №17High Performance Deep Learning on Intel Architecture, слайд №18High Performance Deep Learning on Intel Architecture, слайд №19High Performance Deep Learning on Intel Architecture, слайд №20High Performance Deep Learning on Intel Architecture, слайд №21High Performance Deep Learning on Intel Architecture, слайд №22High Performance Deep Learning on Intel Architecture, слайд №23High Performance Deep Learning on Intel Architecture, слайд №24High Performance Deep Learning on Intel Architecture, слайд №25High Performance Deep Learning on Intel Architecture, слайд №26High Performance Deep Learning on Intel Architecture, слайд №27High Performance Deep Learning on Intel Architecture, слайд №28High Performance Deep Learning on Intel Architecture, слайд №29High Performance Deep Learning on Intel Architecture, слайд №30High Performance Deep Learning on Intel Architecture, слайд №31High Performance Deep Learning on Intel Architecture, слайд №32High Performance Deep Learning on Intel Architecture, слайд №33High Performance Deep Learning on Intel Architecture, слайд №34High Performance Deep Learning on Intel Architecture, слайд №35

Вы можете ознакомиться и скачать презентацию на тему High Performance Deep Learning on Intel Architecture. Доклад-сообщение содержит 35 слайдов. Презентации для любого класса можно скачать бесплатно. Если материал и наш сайт презентаций Mypresentation Вам понравились – поделитесь им с друзьями с помощью социальных кнопок и добавьте в закладки в своем браузере.

Слайды и текст этой презентации


Слайд 1





High Performance Deep Learning on Intel® Architecture 
Ivan Kuzmin
Engineering Manager for AI Performance Libraries
December 19, 2016
Описание слайда:
High Performance Deep Learning on Intel® Architecture Ivan Kuzmin Engineering Manager for AI Performance Libraries December 19, 2016

Слайд 2





Fast Evolution of Technology
Описание слайда:
Fast Evolution of Technology

Слайд 3





Classical Machine Learning
Описание слайда:
Classical Machine Learning

Слайд 4





Deep learning
Описание слайда:
Deep learning

Слайд 5





End-to-End Deep Learning
Описание слайда:
End-to-End Deep Learning

Слайд 6





Automating previously “human” tasks
Описание слайда:
Automating previously “human” tasks

Слайд 7





Deep Learning Challenges
Описание слайда:
Deep Learning Challenges

Слайд 8





Deep Learning Challenges
Описание слайда:
Deep Learning Challenges

Слайд 9





Scaling is I/O Bound
Описание слайда:
Scaling is I/O Bound

Слайд 10





Intel Provides the Compute Foundation for DL
Описание слайда:
Intel Provides the Compute Foundation for DL

Слайд 11





INTEL® MKL-DNN
Описание слайда:
INTEL® MKL-DNN

Слайд 12





Deep learning with Intel® MKL-DNN
Описание слайда:
Deep learning with Intel® MKL-DNN

Слайд 13





Deep learning with Intel® MKL-DNN
Описание слайда:
Deep learning with Intel® MKL-DNN

Слайд 14





Deep learning with Intel® MKL-DNN
Описание слайда:
Deep learning with Intel® MKL-DNN

Слайд 15





Intel® Xeon Phi ™ processor 7250  up to 400x performance increase with Intel Optimized Frameworks compared to baseline out of box performance
Описание слайда:
Intel® Xeon Phi ™ processor 7250 up to 400x performance increase with Intel Optimized Frameworks compared to baseline out of box performance

Слайд 16





Intel® Xeon Phi ™ processor Knights Mill up to 4x estimated performance improvement over Intel® Xeon Phi™ processor 7290
Описание слайда:
Intel® Xeon Phi ™ processor Knights Mill up to 4x estimated performance improvement over Intel® Xeon Phi™ processor 7290

Слайд 17





INTEL® Machine Learning Scaling Library
Описание слайда:
INTEL® Machine Learning Scaling Library

Слайд 18





Intel® Machine Learning Scaling Library (MLSL)
Deep learning abstraction of message-passing implementations.
Built on top of MPI, allows other communication libraries to be used
Optimized  to drive scalability of communication patterns
Works across various interconnects: Intel® Omni-Path Architecture, InfiniBand, and Ethernet
Common API to support Deep Learning frameworks (Caffe, Theano, Torch etc.)
Описание слайда:
Intel® Machine Learning Scaling Library (MLSL) Deep learning abstraction of message-passing implementations. Built on top of MPI, allows other communication libraries to be used Optimized to drive scalability of communication patterns Works across various interconnects: Intel® Omni-Path Architecture, InfiniBand, and Ethernet Common API to support Deep Learning frameworks (Caffe, Theano, Torch etc.)

Слайд 19





Intel® Xeon Phi™ Processor 7250 GoogleNet V1 Time-To-Train Scaling Efficiency        up to 97% on 32 nodes
Описание слайда:
Intel® Xeon Phi™ Processor 7250 GoogleNet V1 Time-To-Train Scaling Efficiency up to 97% on 32 nodes

Слайд 20





NeON framework
Описание слайда:
NeON framework

Слайд 21





Neon: DL Framework with Blazing Performance
Описание слайда:
Neon: DL Framework with Blazing Performance

Слайд 22





Intel® Nervana™ Graph Compiler
Intel® Nervana™ Graph Compiler: 
High-level execution graph
for neural networks to enable
optimizations that are applicable 
across multiple HW targets.
Описание слайда:
Intel® Nervana™ Graph Compiler Intel® Nervana™ Graph Compiler: High-level execution graph for neural networks to enable optimizations that are applicable across multiple HW targets.

Слайд 23





Intel® Nervana™ Graph Compiler as the performance building block…
Описание слайда:
Intel® Nervana™ Graph Compiler as the performance building block…

Слайд 24





INTEL® DEEP Learning SDK
Описание слайда:
INTEL® DEEP Learning SDK

Слайд 25





Intel® Deep Learning SDK
Accelerate Your Deep Learning Solution
A free set of tools for data scientists and software developers to develop, train, and deploy deep learning solutions
Описание слайда:
Intel® Deep Learning SDK Accelerate Your Deep Learning Solution A free set of tools for data scientists and software developers to develop, train, and deploy deep learning solutions

Слайд 26





Deep Learning Training Tool
Intel® Deep Learning SDK

Simplify installation of Intel optimized Deep Learning Frameworks
Easy and Visual way to Set-up, Tune and Run Deep Learning Algorithms:
Create training dataset
Design model with automatically optimized hyper-parameters
Launch and monitor training of multiple candidate models
Visualize training performance and accuracy
Описание слайда:
Deep Learning Training Tool Intel® Deep Learning SDK Simplify installation of Intel optimized Deep Learning Frameworks Easy and Visual way to Set-up, Tune and Run Deep Learning Algorithms: Create training dataset Design model with automatically optimized hyper-parameters Launch and monitor training of multiple candidate models Visualize training performance and accuracy

Слайд 27





Deep Learning Deployment Tool
Intel® Deep Learning SDK

Unleash fast scoring performance on Intel products while abstracting the HW from developers
Imports trained models from all popular DL framework regardless of training HW
Compresses model for improved execution, storage & transmission (pruning, quantization)
Generate Inference HW-Specific Code (C/C++, OpenVX, OpenCL, etc.) 
Enables seamless integration with full system / application software stack
Описание слайда:
Deep Learning Deployment Tool Intel® Deep Learning SDK Unleash fast scoring performance on Intel products while abstracting the HW from developers Imports trained models from all popular DL framework regardless of training HW Compresses model for improved execution, storage & transmission (pruning, quantization) Generate Inference HW-Specific Code (C/C++, OpenVX, OpenCL, etc.) Enables seamless integration with full system / application software stack

Слайд 28





Deep Learning Tools for End-to-End Workflow
Intel® Deep Learning SDK
Описание слайда:
Deep Learning Tools for End-to-End Workflow Intel® Deep Learning SDK

Слайд 29





Leading AI research
Описание слайда:
Leading AI research

Слайд 30





Summary
Intel provides highly optimized libraries to accelerate all DL frameworks
Intel® Machine Learning Scaling Library (MLSL) allow to scale DL to 32 nodes and beyond
Nervana graph compiler, next innovation for DL performance
Intel® Deep Learning SDK make it easy for you to start exploring DeepLearning
Intel is committed to provide algorithmic, SW and HW innovations to get best performance for DL on IA
Get more details at:
https://software.intel.com/en-us/ai/deep-learning
Описание слайда:
Summary Intel provides highly optimized libraries to accelerate all DL frameworks Intel® Machine Learning Scaling Library (MLSL) allow to scale DL to 32 nodes and beyond Nervana graph compiler, next innovation for DL performance Intel® Deep Learning SDK make it easy for you to start exploring DeepLearning Intel is committed to provide algorithmic, SW and HW innovations to get best performance for DL on IA Get more details at: https://software.intel.com/en-us/ai/deep-learning

Слайд 31





Legal Disclaimer & Optimization Notice
INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors.  Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions.  Any change to any of those factors may cause the results to vary.  You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. 
Copyright © 2016, Intel Corporation. All rights reserved. Intel, Pentium, Xeon, Xeon Phi, Core, VTune, Cilk, and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries.
Описание слайда:
Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. Copyright © 2016, Intel Corporation. All rights reserved. Intel, Pentium, Xeon, Xeon Phi, Core, VTune, Cilk, and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries.

Слайд 32


High Performance Deep Learning on Intel Architecture, слайд №32
Описание слайда:

Слайд 33





Configuration details
BASELINE: Caffe Out Of the Box, Intel® Xeon Phi™ processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM: cache mode), 96GB memory,  Centos 7.2 based on Red Hat* Enterprise Linux 7.2, BVLC-Caffe: https://github.com/BVLC/caffe, with OpenBLAS, Relative performance 1.0
NEW: Caffe: Intel® Xeon Phi™ processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM: cache mode), 96GB memory,  Centos 7.2 based on Red Hat* Enterprise Linux 7.2, Intel® Caffe: : https://github.com/intel/caffe based on BVLC Caffe as of Jul 16, 2016, MKL GOLD UPDATE1, Relative performance up to 400x

AlexNet used for both configuration as per https://papers.nips.cc/paper/4824-Large image database-classification-with-deep-convolutional-neural-networks.pdf, Batch Size: 256
Описание слайда:
Configuration details BASELINE: Caffe Out Of the Box, Intel® Xeon Phi™ processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM: cache mode), 96GB memory, Centos 7.2 based on Red Hat* Enterprise Linux 7.2, BVLC-Caffe: https://github.com/BVLC/caffe, with OpenBLAS, Relative performance 1.0 NEW: Caffe: Intel® Xeon Phi™ processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM: cache mode), 96GB memory, Centos 7.2 based on Red Hat* Enterprise Linux 7.2, Intel® Caffe: : https://github.com/intel/caffe based on BVLC Caffe as of Jul 16, 2016, MKL GOLD UPDATE1, Relative performance up to 400x AlexNet used for both configuration as per https://papers.nips.cc/paper/4824-Large image database-classification-with-deep-convolutional-neural-networks.pdf, Batch Size: 256

Слайд 34





Configuration details
BASELINE: Intel® Xeon Phi™ Processor 7290 (16GB, 1.50 GHz, 72 core) with 192 GB Total Memory on Red Hat Enterprise Linux* 6.7 kernel 2.6.32-573 using MKL 11.3 Update 4, Relative performance 1.0
NEW: Intel® Xeon phi™ processor family – Knights Mill, Relative performance up to 4x
Описание слайда:
Configuration details BASELINE: Intel® Xeon Phi™ Processor 7290 (16GB, 1.50 GHz, 72 core) with 192 GB Total Memory on Red Hat Enterprise Linux* 6.7 kernel 2.6.32-573 using MKL 11.3 Update 4, Relative performance 1.0 NEW: Intel® Xeon phi™ processor family – Knights Mill, Relative performance up to 4x

Слайд 35





Configuration details
Описание слайда:
Configuration details



Похожие презентации
Mypresentation.ru
Загрузить презентацию