2022-10-17 08:44:37
Your Embedded Edge Starts Here
Join Us at ARC Processor Summit 2018
Tuesday, September 11, 2018
8:30 a.m. - 6:30 p.m. PDT |
|
DoubleTree by Hilton Hotel
2050 Gateway Place, San Jose, CA 95110
|
This free one-day event consists of multiple tracks in which 草榴社区 experts, ecosystem partners and the ARC user community will deliver technical presentations on a range of topics, including, artificial intelligence (AI), automotive safety, internet of things (IoT), embedded vision and much, much more.
Come and learn about the latest technologies and trends in embedded processor IP, software, programming tools and applications.
Keynote Address
9:15am - 10:15am
Life on the “Edge”
Satyen Yadav, General Manager, IoT Ecosystem Development, Amazon Web Services
Satyen Yadav is a Director and General Manager at Amazon Web Services. He has the global responsibility for developing an ecosystem for AWS’s end-to-end solutions for the Internet of Things, including device software, edge computing software, and associated cloud services. His team has built IoT services such as AWS Greengrass (edge computing), Greengrass ML Inference (machine learning at the edge), Amazon FreeRTOS (IoT OS for MCU class devices), and AWS IoT 1-Click service for ready-to-deploy devices. Satyen has helped customers globally to maximize operational efficiency and deliver unique user experiences through state of the art technologies such as the Internet of Things, IIoT, Machine Learning, Intelligent Edge and Cloud Services, Multi-access Edge Computing (MEC), Industry 4.0, Autonomous Systems, Wireless Devices, and Information & Cyber Security.
Before joining Amazon, Satyen was General Manager of IoT and Wearable 草榴社区 at Intel Corporation and led a global organization to deliver hardware and software solutions for IoT and wearable devices. Before this, he held global leadership positions in engineering and business functions at Intel and helped establish new businesses in wireless networking and cybersecurity areas. Satyen earned MBA from Kellogg School of Management at Northwestern University, and an MS in Computer Science from Iowa State University.
Today, an increasing number of objects are being connected to the internet at an unprecedented rate. As a result, customers are collecting vast amounts of valuable IoT data that was previously unavailable. We are already seeing the benefits of applying machine learning models to process data at the source where it is being generated- farmers predict crop yield, power companies predict energy demand, vehicles can identify distracted drivers, and doctors deliver improved care with real-time insight from medical devices … the possibilities for applying intelligence at the edge are countless. However, processing and analyzing this vast amount of IoT data is not possible with the help of traditional business intelligence tools. In this session, we will showcase how customers can use AWS’s IoT, artificial intelligence, and machine learning services to gain predictive insights and take intelligent, real-time actions on their IoT data, from the cloud to the edge.
Automotive Track
10:30am - 11:15am
The Marriage of AI and Safety in Automotive SoCs
Fergus Casey, Director, R&D, ARC Processors, 草榴社区
Fergus Casey is Director, R&D for ARC Processors at 草榴社区, with responsibility for the ARC EM, EV + CNN, ARC 600 and ARC 700 processor families IP development. He is also the Safety Manager for the ARC Processors. He joined the ARC International R&D team in 2003 and has worked in various roles within the ARC processor group as part of ARC International and through the acquisitions by Virage Logic and later 草榴社区 in 2010. Prior to joining ARC, he worked in a number of fabless semiconductor & IP startups.
Fergus holds a bachelor’s degree in electrical engineering from University College Cork, Ireland.
As the automotive industry looks beyond Level 2 (Driver Assist) designs, the race is on to deliver high-performance safety-critical autonomous vehicle components powered by the latest AI technology. AI techniques can provide increased accuracy for object and pedestrian detection, but these designs must still meet the ISO 26262 standard’s most stringent level of functional safety and fault coverage. In this presentation, we analyze an autonomous driving use-case emphasizing the need for the inseparable union of AI and safety. We will discuss how 草榴社区 achieves this marriage without significant impact on performance, power, or area compared to non-ASIL Ready processors. From an understanding of this use-case and requirements, we present an embedded SoC solution providing the highest level of safety without compromising AI performance.
From architecture through to tape-out, we provide an overview of the design, verification, and safety methodologies required for SoC safety certification.
11:15am - 12:00pm
Reducing the Total Cost of Ownership with Classic and Adaptive AUTOSAR
Dheeraj Sharma, Product Expert, Elektrobit
Dheeraj Sharma is an expert in Automotive Open System Architecture (AUTOSAR) at Elektrobit (EB). Dheeraj has nearly a decade of experience in automotive software and is a certified functional safety engineer with extensive experience in the development of ISO 26262 functional safety software. Prior to joining EB, Dheeraj was a software specialist at iGATE. He holds a bachelor’s degree in engineering from Punjab Technical University, Jalandhar, India. He is based in Farmington Hills, Michigan.
In this talk, we cover the key features of Classic and Adaptive AUTOSAR and how these features reduce the total cost of ownership. Participants should expect to walk away with a high-level overview of AUTOSAR and an understanding of how it provides value for their current and future automotive projects. We focus on the design, configuration, security, and safety aspects that provide the hallmark flexibility of the AUTOSAR platform. Finally, we discuss migration and reuse strategies that match the changing E/E architecture seen within the vehicles.
1:00pm - 1:45pm
Personalize the In-Cabin Experience with Face Recognition and Inference of Driver Emotional States
Vinay M K, Co-founder and Vice President, PathPartner
Vinay M K is the Co-founder and Vice President of products at PathPartner Technologies Pvt Ltd and is responsible for development and deployment of licensable solutions. Prior to PathPartner, he worked as a Technical Manager at Emuzed and an Engineering Manager for Audio at Aricent. His areas of interest include adaptive signal processing, machine learning, audio processing algorithms, speech recognition, python programming, product management and marketing analytics. He has several publications in audio and speech processing domains. Vinay obtained his MBA in Enterprise Management from the Indian Institute of Management Bangalore, his MTech in Signal Processing from the Indian Institute of Technology Kanpur, and his BE in Electronics and Communications from National Institute of Engineering, University of Mysore.
Cars are more than just transportation – they provide entertainment, communication, convenience, and more. The ideal in-cabin experience is personalized to each occupant in the car to enhance aesthetics and entertainment as well as safety and automotive performance. Offering personalized experiences gives car manufacturers a way to differentiate their products from other options in the market. In this presentation, we will discuss how designers and system architects can enhance in-cabin personalization with face recognition, micro radar, iris scanning, and other sensor fusion technologies to implement, for example, driver drowsiness alerts. We will explain how driver and passenger recognition can be the key enabler for personalization targets, and how to provide this functionality with limited impact on power and area. Finally, we will explore the types of CNN networks available for recognizing actions/expressions and how recognition can provide semantic context.
1:45pm - 2:30pm
Taming AI Using Convolutional Neural Networks with Compression and Pruning
Bo Wu, Applications Engineer, 草榴社区
Bo Wu is a corporate applications engineer at 草榴社区 supporting the 草榴社区 EV vision processors. He holds Bachelors and Master’s degrees from Tsinghua University in China, and a Ph.D. from the University of Victoria in Canada. Between 1996 and 2000, he worked as a senior system engineer and DSP engineer at Nortel Networks in Ottawa and AT&T Wireless in Seattle, respectively. Afterwards, he held various engineering and technical marketing positions focusing on system-level design products and processor solutions at Cadence, CoWare, and 草榴社区.
To implement AI applications at the edge, you need to move from training to inference on your embedded target. This transition comes with a new set of considerations to take into account. For the target system, you must consider factors such as performance, memory size, throughput, and bandwidth, as well as maintaining the accuracy of your graph. In this session, we will show how you can profile your graph and then achieve your design targets with techniques such as feature-map compression, graph acceleration and coefficient pruning.
2:30pm - 3:15pm
Optimizing Deep Learning Perception Software for ADAS and Autonomous Driving
Chungbin Heo, Optimization Engineer, StradVision, Inc.
Chungbin is a senior optimization engineer at StradVision with 11 years of experience in computer vision and artificial intelligence software engineering for embedded hardware. Prior to StradVision, he worked for 5 years at Olaworks, Inc. and joined Intel as an acquisition of Olaworks by Intel in 2012. He has wide range of knowledge on integration/optimization for computer vision and deep learning algorithms on various platforms. He has B.S in software engineering at Kwangwoon University in South Korea.
StradVision has developed SVNet, a deep-learning-based object detection of 6 object classes (Pedestrian, Car, Bus, Truck, Box Truck, Two-wheeler). SVNet is robust for bad weather/lighting conditions (e.g., rain, snow, fog, night), small object sizes (e.g., 32 pixel height pedestrian, 20 pixel height vehicle), and occlusion (up to 75% occlusion). In this presentation, I will describe automotive OEM/Tier-1’s requirements (for frontal camera, rear camera, around view monitoring), technical challenges, and how our solution can address them with 草榴社区 EV6x Vision Processors and its dedicated CNN Engine.
3:30pm - 4:15pm
Functional Safety Certification - Your Advantage
Gudrun Neumann, Software Manager & Team Lead for Functional Safety Software Team, SGS-T?V Saar
Gudrun Neumann has worked in the field of Functional Safety of safety-relevant controls with a focus on software in the areas of automotive, machinery, industry, household, and medical devices since 2005.
Ms. Neumann has been Product Manager for Software at SGS-T?V Saar GmbH since 2010 and team leader of the Functional Safety Software Team since 2012. In these roles, she is responsible for standard compliant definition and implementation of analyses and assessments of complex software systems. Based on many years of practice, she offers trainings for application of Functional Safety standards and carries out workshops for safety-relevant software development. Gudrun Neumann studied Informatics at Technical University Munich, where she received a degree as Diplom-Informatikerin (Univ.) in 1990.
The ISO 26262 standard does not require a certification of processes or products. But, certifications can ease your development cycle. In this presentation, we will describe the advantages of process, product and personal certifications using 草榴社区 processes and products as examples. We will also show how providing certificates to customers means offering them valuable additional benefits.
4:15pm - 5:00pm
Extending Control and DSP Performance for Automotive RADAR Applications
Graham Wilson, Sr. Product Marketing Manager, ARC Processors, 草榴社区
Graham has over 25 years in the semiconductor and IP industry, including stints at Motorola (Freescale), Mitel, Sony, Renesas and Tensilica (Cadence). His roles have included chip design, design management and product management, as well as IP product marketing. During his 7-year tenure at Tensilica (and subsequently Cadence), Graham had product marketing responsibility for their DSP cores targeting baseband applications (e.g., 3G, 4G) as well as the definition of new DSP cores targeting IoT (which later become Tensilica’s Fusion DSP product line).
This session will cover object detection and classification techniques using the 77GHz RADAR signal source for automotive advanced driver-assistance system (ADAS) applications. An overview of the RADAR technology used in ADAS applications, including review of the relevant signal processing, typical requirements and design parameters will be given, and some of the design considerations and trade-offs of implementing the RADAR system will be discussed. We will close with and an example of an efficient RADAR signal processing chain (3D, FFT, CFAR, clustering and tracking) algorithm implemented on the ARC HS47D processor, where processing on the ARC DSP core is the forming a data cube (3D FFT) to find range, relative velocity, and direction of arrival (DoA) of objects. In this example, front-end FFT processing is implemented on dedicated hardware blocks, performing 3-stage FFT computation.
IoT/Digital Home Track
10:30am - 11:15am
Addressing the Challenges of Always-on IoT with Efficient Processors for Machine Learning
Pieter van der Wolf, Principal Product Architect, 草榴社区
Pieter van der Wolf is a Principal Product Architect at 草榴社区. He received his MSc and PhD degrees in Electrical Engineering from the Delft University of Technology. He was an Associate Professor at the Delft University of Technology before joining Philips Research in 1996. In 2006 he joined NXP Semiconductors when it was spun out of Philips Electronics. In 2009 he joined Virage Logic, which was subsequently acquired by 草榴社区. He has worked on a broad range of topics including multi-processor architectures and system design methodologies.
The ARC EM DSP processors are extremely well-suited for performing machine learning inference in always-on IoT devices. Smart IoT devices increasingly offer always-on features to allow advanced user control. For example, they can be always-listening to allow control through voice commands, or always-watching to support system wake-up by means of a face trigger. Such always-on functions often employ machine learning techniques for recognizing voice commands, faces, etc. A key requirement for implementing such functions is a very low power consumption, as battery lifetime is key for the always-on capability. In this presentation we discuss the key features of the ARC EM DSP processors that enable efficient machine learning inference. We then show how excellent results, in terms of low MHz requirements, small code size and low power consumption, are achieved for voice trigger and face trigger applications.
11:15am - 12:00pm
Implementing Artificial Intelligence in Embedded Vision, IoT, and Smart Home Applications
Bruno Lavigueur, ASIC Digital Design Engineer, 草榴社区
Bruno brings more than 15 years of experience to his role as Senior R&D Engineer for Embedded Vision Processors at 草榴社区 where he works on system level architecture and hardware design. Prior to 草榴社区, Bruno worked on platform-based automation tools and methodologies for MP-SoCs and system-level architecture at STMicroelectronics. Bruno graduated with a bachelor’s degree in Computer Engineering and a master’s degree in Electrical Engineering from the ?cole Polytechnique de Montréal, Canada.
As neural network techniques are applied to consumer applications, designers must figure out how to introduce these computationally demanding algorithms while minimizing power consumption. This presentation will discuss how to balance the tradeoffs between performance, power, area, and bandwidth in AI applications. It will cover the evolution of CNN graphs, and describe the attributes of popular graphs such as Masknet, ICNET, and RetinaNet for IoT and smart home applications. Finally, it will touch on how an embedded vision processor architecture can maximize computational efficiency without sacrificing accuracy, using facial recognition for the IoT as an example.
1:00pm - 1:45pm
Fast and Ultra-Low Power Graphics Development for Mobile & Embedded Systems
Iakovos Stamoulis, CTO and Co-Founder, Think Silicon
Iakovos Stamoulis has over 20 years of experience in the computer graphics and semiconductor industries. Before founding Think Silicon, he worked for Advanced Rendering Technology in the UK and USA, where he co-engineered the first Ray Tracing Graphics Engine chip. Prior to that he led the engineering team at Atmel's Multimedia and Communication Business Unit. Mr. Stamoulis earned a Ph.D. in 2001 from the Centre for VLSI and Computer Graphics of the University of Sussex, UK and has been a member of IEEE since 1996 and a contributor in the Khronos Group.
High quality CGI (Computer Graphics) and high-resolution display support is proliferating dramatically in embedded, automotive, wearable and IoT devices.
NEMA?|t200 is the latest member of the NEMA|GPU-Family and is a perfect candidate, in combination with ARC Processors, for the acceleration of OpenGL|ES / OpenVG graphics content, providing very high graphics performance even in memory and power resource limited applications. In addition to the GPU, Think Silicon’s focus is to provide a comprehensive SDK including NEMA|Power-Profiler and NEMA|GUI-Builder to support and help developers to optimize their applications for performance and power and to assist them in rapid system deployment. In this presentation Think Silicon will demonstrate the strength of a fully-fledged development system, from RTL implementation to GUI (Graphical User Interface) creation.
1:45pm - 2:30pm
Securing Mobile IoT from Chip to Cloud with Integrated SIM 草榴社区
Michael Moorfield, Head of Technology Strategy and Innovation, Truphone
Michael Moorfield is the Head of Technology Strategy and Innovation for Truphone responsible for driving eSIM ecosystem development and technology partnerships. He is a member of the GSMA eSIM and security accreditation scheme working groups and has led the recent accreditation of Truphone’s Remote SIM Provisioning site in London, UK. Prior to joining Truphone, Michael worked for IBM as a managing consultant for several large multi-national Banking and Telecommunication organizations. Michael holds a bachelor’s degree in Software Engineering from RMIT, Melbourne, Australia.
The SIM card as we know it is disappearing. This session explores how integrating SIM functionality into SoC designs can greatly simplify how mobile IoT connectivity is enabled, secured, and managed. Truphone will demonstrate how its eSIM solution running on an ARC-based security platform, combined with its global connectivity and remote SIM provisioning services is enabling secure, out of the box connectivity from chip to cloud.
2:30pm - 3:15pm
Streaming Low-Power Audio to “Hearable” Devices Using Bluetooth 5
Ron Lowman, Strategic Marketing Manager, 草榴社区
Ron Lowman is the Strategic Marketing Manager for IoT at 草榴社区 responsible for driving the IoT Strategic initiatives working closely with many of the IP Product Marketing Managers. Prior to joining 草榴社区, Ron spent 16 years at Freescale within their MCU division. His background includes stints in strategy, business development, product marketing and engineering roles supporting IC test for automotive engine controllers, and factory automation and controls design. Ron holds a Bachelor of Science degree in Electrical Engineering from The Colorado School of Mines and a Master’s degree in Business Administration from the University of Texas in Austin.
Wireless audio solutions have become pervasive, however, battery life remains a key limitation. The implementation of a standardized audio solution over Bluetooth will revolutionize the adoption and use of hearables not only because of the battery life improvement but because of the new applications Bluetooth Low Energy audio will enable. We will explore the new Low Complexity Communication Codec (LC3), the required IP components, and how those IP components are being optimized for next-generation Bluetooth audio hardware for SoC integration.
3:30pm - 4:15pm
Deploying NB-IoT Communication 草榴社区 with Extensible Processors
Anatoly Savchenkov, Software Engineering Manager, 草榴社区
Anatoly Savchenkov is an R&D Manager at 草榴社区 and is responsible for embedded software running on ARC cores and subsystems. He came to 草榴社区 through acquisitions of Virage Logic and ARC International where he had similar roles. Anatoly holds a Master’s degree in computer science from St. Petersburg Polytechnic University in St. Petersburg, Russia.
NB-IoT is an emerging technology for narrow-band wireless communication standardized by the 3GPP. It was designed with a focus on minimizing the end-user equipment power and performance requirements to enable the widespread deployment of NB-IoT compatible devices and ensure quick technology adoption. It created a market for licensable software-defined IP and increased the demand for efficient and flexible IP cores to execute this software. This presentation highlights the key challenges of NB-IoT modem design and proposes an optimization strategy demonstrating the efficiency of 草榴社区 ARC EMxD family of cores as a platform for software-defined NB-IoT modems.
4:15pm - 5:00pm
Reducing Dynamic Power and Time-to-Tapeout for High-Performance AI Processor SoCs
Yudhan Rajoo, Sr. Technical Marketing Manager, 草榴社区
Yudhan brings more than 8 years’ experience to his role as Senior Technical Marketing Manager for Logic Libraries at 草榴社区. Prior to his current role, Yudhan was a Senior Field Applications Engineer at 草榴社区 supporting logic libraries worldwide and earlier a Design Engineer for IP testchips at ARM. Yudhan has experience implementing complex processor designs on advanced geometries. Yudhan graduated with an honors.
Developing new AI applications requires implementing cutting-edge technologies while meeting performance, power, area and time-to-market requirements. Edge applications require low power, while CNN engines can be especially challenging to a design’s power budget due to the density of the multiply-accumulates needed to run large neural network computations. In addition, designers can face tedious and time-consuming iterations for floor planning and routing in order to meet PPA targets. This presentation will describe how usage of specialized logic cells and memories can address specific RTL-to-GDSII implementation challenges for CNN engines while reducing time-to-tapeout with optimal PPA. We will show a case study describing how utilizing the HPC Design Kit of logic libraries and embedded memories optimized for the 草榴社区 EV61 Embedded Vision Processor resulted in lower power and faster design closure.
Mobile/Storage Track
10:30am - 11:15am
Using Trace Visualization for Efficient Debugging of Embedded Systems
Johan Kraft, CEO, Percepio
Dr. Johan Kraft is the CEO, CTO and founder of Percepio AB, a Swedish company developing advanced trace visualization tools for embedded systems developers. Dr. Kraft holds a Ph.D. in Computer Science and is the original author of the company’s flagship product Tracealyzer.
His earlier academic work focused on practical methods for timing analysis of embedded software, performed in close collaboration with regional industry. Before his doctoral studies, Dr. Kraft developed control software for industrial robots.
Software issues related to timing or resource usage can be difficult since they are not directly visible in the source code, but rather are an emerging property of the system. Such issues call for software tracing. Developers often associate tracing with instruction tracing and overwhelming amounts of low-level information, where you can’t see the forest for all the trees. However, recent advances in trace visualization and software-generated tracing offers an alternative approach that is more suitable for finding anomalies in complex software behavior. This presentation introduces software tracing and demonstrates the potential of state-of-the-art trace visualization, using Percepio Tracealyzer as an example.
11:15am - 12:00pm
High-Performance 草榴社区 for Next-Generation SSD Designs
Michael Thompson, Sr. Product Marketing Manager, ARC Processors, 草榴社区
Michael Thompson is responsible for the definition and marketing of the high-end ARC microprocessor products at 草榴社区. Mike has more than 30 years of experience in both the design and support of microprocessors, microcontrollers, IP cores, and the development of embedded applications and tools. He has worked previously for Actel, MIPS, ZiLOG, Philips/Signetics, and AMD. He has a BSEE from Northern Illinois University and an MBA from Santa Clara University.
Storage is a critical component of the technology enabling online business, information access, streaming video, artificial intelligence and much more. Most of the electronics that we use today wouldn’t be possible without the ever-increasing size and performance that we are seeing from flash storage. This increasing capacity and performance will challenge current methods of maintaining and using data in storage mediums. This is leading to an interest in using artificial intelligence to enable software to dynamically balance and optimize data on SSDs to maximize capacity and throughput. This presentation will investigate how machine learning and other techniques will be used on future SSD designs, and the underlying software and hardware that will make it possible.
1:00pm - 1:45pm
Easing Complex Application Development with Processor and System Trace Resources
Michael Doan, ASIC Digital Design Engineer, 草榴社区
Michael Doan brings more than 20 years’ experience in digital design, system integration, and verification to his role as technical lead for trace and debug for ARC Processors. Before joining 草榴社区, Michael worked on multicore SoCs at AMD, Freescale Semiconductors, and NXP. He has a Bachelor’s degree in electrical engineering from the University of Texas at Austin.
Complex application development is made easier by integrating trace and debug systems that enable observability and the ability to debug early in the design phase, minimizing bug discovery impact, by focusing development resources. This presentation demonstrates the flexibility and breadth of resources provided by ARC Trace 草榴社区 for advanced processor and system development.
1:45pm - 2:30pm
Enabling Ultra-High Performance, Low-Power 5G Modem Designs with Heterogeneous Multicore Systems
Pieter van der Wolf, Principal Product Architect, 草榴社区
Pieter van der Wolf is a Principal Product Architect at 草榴社区. He received his MSc and PhD degrees in Electrical Engineering from the Delft University of Technology. He was an Associate Professor at the Delft University of Technology before joining Philips Research in 1996. In 2006 he joined NXP Semiconductors when it was spun out of Philips Electronics. In 2009 he joined Virage Logic, which was subsequently acquired by 草榴社区. He has worked on a broad range of topics including multi-processor architectures and system design methodologies.
The 5G standard pushes the requirements on wireless communications equipment for greater than 1Gbps data rates as well as reducing system latency, allowing an expansion of 5G use cases to automotive and other timing-critical applications. SoC modem developers for 4G systems previously met performance requirements with heterogeneous systems, using multiple task-specific processor cores. 5G modem SoCs for user equipment (mobile devices) will need to take the heterogeneous implementation further to provide greater performance for higher data rates, larger MIMO configurations and lower latency, while maintaining similar power budgets to 4G modems. This session will go through the range of digital signal processors, controller cores and task-specific cores that will allow 5G modem SoC developers to implement the required amount of programmability/flexibility in their design, while achieving the performance and low power requirements.
2:30pm - 3:15pm
Accelerating Group Theoretic Cryptography with ARC APEX Instructions
Drake Smith, Vice President of Development, SecureRF
Drake Smith has 35 years of experience developing hardware, software, and VLSI solutions for security, automotive, military, and consumer electronics products. He leads a development team which includes embedded software developers, system software developers, and digital logic designers. Mr. Smith holds a Bachelor of Science in Engineering from University of Lowell, and a Master of Software Engineering from Brandeis University.
This presentation will describe how a mathematically efficient cryptographic operation is significantly sped up using the ARC Processor EXtension (APEX) technology. We will give a brief overview of the underlying math operations and how they are implemented in software only. Then we will outline our approach at offloading the most compute-intensive operations onto hardware together with the design process we followed using 草榴社区 ARChitect, MetaWare, and Intel Quartus Prime. Attendees will see a comparison of the resulting performance metrics with APEX versus an assembly language-only implementation.
The design example will use a fast, small-footprint, and low energy digital signature algorithm that has immediate applicability for a wide range of IoT solutions and is ideally suited for applications where an ARC processor may need to securely communicate with an 8- or 16-bit device. Attendees will learn how to incorporate this and other security methods into their own ARC-based designs.
3:30pm - 4:15pm
Building an Embedded Vision Application with a Caffe CNN Model and OpenVX
Jamie Campbell, Software Applications Engineer, Staff, 草榴社区
Jamie Campbell is a Staff Software Engineer at 草榴社区 who leads the Embedded Vision Processor Applications Team, responsible for creating interesting demos and reference applications for the 草榴社区 EV processor. Prior to focusing on embedded vision, Jamie has worked in various capacities as an embedded software specialist, including R&D engineer, Field Applications Engineer and Corporate Applications Engineer at Precise Software Technologies, ARC International, Virage Logic and now 草榴社区. Jamie holds a Bachelor of Science in Electrical Engineering from the University of Calgary, Canada.
You’ve picked your CNN graph and embedded vision processor – now what? In this tutorial, we will walk you through the tool flow required to take your concept to reality, using a Caffe CNN graph and OpenVX kernels with MetaWare EV Development Toolkit, the software development environment for 草榴社区 EV6x processors. We will explain how to use OpenVX to create an image processing graph in conjunction with the CNN engine, as well as covering best practices for optimizing your application for an embedded environment.
4:15pm - 5:00pm
Accelerating AI/Neural Network Performance While Reducing Power in Android Devices
Mischa Jonker, Software Engineer, 草榴社区
Mischa is an embedded software engineer and architect, currently working on the software stack for the 草榴社区 ARC EV6x embedded vision processor family. He has more than 15 years of experience in the embedded software field. His earlier assignments include embedded Linux for various consumer devices, and architect of camera imaging firmware. Mischa holds a Master’s degree in Technical Computer Science from the University of Twente, the Netherlands.
With Android 8.1, Google has added the Neural Network API to the Android mobile operating system. This API is designed to ease offloading neural network workloads to accelerators such as the GPUs, DSPs and other optimized processors such as the 草榴社区 EV6x embedded vision processor. In this presentation, we will show how the Android Neural Network API, TensorFlow Lite and the EV6x processor work together to increase neural network performance while reducing power consumption. Power and performance comparisons between running on an application processor versus the benefits of offloading computations to the EV6x processor will be discussed.
Demos
Real-Time Trace for ARC Processors – Ultra-XD Hardware Trace Probe – Ashling
The Ultra-XD Trace Probe allows you to capture trace from an ARC HS or EM core and upload it to the MetaWare Debugger at gigabit Ethernet speeds. Captured trace can be turned into a “replay” database enabling you to debug your program by executing it both forwards and backwards. Developed in cooperation with 草榴社区, the Ultra-XD probe integrates with the MetaWare or GNU GDB Debuggers under Windows or Linux based hosts.
Debugging ARC with Lauterbach TRACE32 – Lauterbach
Lauterbach will demonstrate debugging on an ARC EM processor target using the TRACE32 debugger including source level code debugging with full on-chip breakpoint support such as program breakpoints, address read/write breakpoints, data breakpoints for memory writes and breakpoints on auxiliary registers.
Morpho Imaging and AI Demos – Morpho
Morpho, an R&D oriented software vendor, delivers cutting-edge image processing and computer vision based technologies to smartphone and other industries and contributes camera quality and functionality competitiveness. Through the partnership in the 草榴社区 ecosystem, Morpho's imaging technologies take advantage of the high-performance and low-power computing capability of the ARC EV processor to be enabled on diverse edge devices. At the ARC Processor Summit 2018, we will exhibit flagship products from our image and video enhancement portfolio, as well as the latest updates in our deep learning and inference technologies.
Face Recognition, Driver Distraction Alert & In-cabin Occupation Monitoring – PathPartner
The generation of cars in development will include face recognition with edge computing and applications to learn driver-specific inattention patterns. Micro-sleep patterns, apnea cycles, reflex latencies, shock responses are all person-specific traits can be learnt by stochastic models. Driver behaviors like parking, talking, laughing and even eating or drinking sequences can all be signatures of the driver and learning them better helps in prediction accuracies and event flagging.
In this demo, we have implemented CNN-based face recognition with high accuracy, along with machine learning based facial landmark detection and tracking. The demo includes features like driver identification, driver distraction, driver drowsiness, emotion classification and in-cabin occupancy monitoring using algorithms optimized to run on embedded platforms like the 草榴社区 EV6x processors.
Percepio Tracealyzer – Percepio
Tracealyzer is the premier trace visualization tool for developers of embedded software systems. Tracealyzer supports several leading real-time operating systems such as Amazon FreeRTOS and ThreadX. In collaboration with 草榴社区, we now introduce Tracealyzer support for profiling of OpenVX applications on 草榴社区 EV6x vision/AI processors.
Walnut DSA Signature Verification Performance, APEX vs. Software-only – SecureRF
This demo highlights the performance advantage achieved by accelerating certain operations with APEX. The demo compares the time it takes (in processor cycles) to verify a digital signature on an ARC EM4 with and without APEX acceleration.
SVNet, Deep-Learning-Based Perception for ADAS and Autonomous Driving – StradVision
StradVision has developed SVNet, a deep-learning-based object detection of 6 object classes (Pedestrian, Car, Bus, Truck, Box Truck, Two-wheeler). SVNet is robust for bad weather/lighting conditions (e.g., rain, snow, fog, night), small object sizes (e.g., 32 pixel height pedestrian, 20 pixel height vehicle), and occlusion (up to 75% occlusion).
Ultra-Low Power Graphics on ARC EM5D and Nema-p GPU Platform – Think Silicon
草榴社区 and Think Silicon developed a prototype sporting a Synpopsys ARC EM5D Processor with a NEMA?- 3D GPU including NEMA?|GFX-API, 9D-Sensor, 5” TFT LCD and is fully battery powered. The solution is aimed for developers to rapidly implement high-quality 3D graphics in connected ultra-low-power wearables and embedded devices with reduced risk and cost.
Verifying System-level Security of the ARC Processor – Tortuga Logic
In this demo, Tortuga Logic will present the results of security analysis performed on the ARC processor using their security simulation software Unison. Several security features are tested, with results displayed in the Unison analysis platform.
Truphone Integrated SIM – Truphone
A demonstration of Truphone’s integrated SIM solution running on the ARC EM Starter Kit board integrated a with a commercial mobile device. It demonstrates the main use-cases of GSMA Remote SIM provisioning for IoT and consumer devices. Using Truphone bootstrap connectivity it will show how in-the-field remote device activation becomes possible for any mobile operator and can completely remove the logistics and costs associated with physical SIM cards.
Android Neural Network Acceleration with EV6x Embedded Vision Processors – 草榴社区
See how the 草榴社区 EV6x Embedded Vision Processor with deep learning can offload application processor tasks to increase performance and reduce power consumption, using an Android Neural Network API example.
Accelerating Software Development and IP Evaluation with 草榴社区 Bluetooth IP – 草榴社区
This demo shows how to integrate Bluetooth 5 and Bluetooth Mesh capabilities into your IoT SoCs, and get a jump start on your next design with industry proven open source software. The demonstration features a complete solution containing 草榴社区 Bluetooth Low Energy PHY and Link Layer IP and 草榴社区 ARC EM Processor.
ARC Software Development at Full Speed - 草榴社区
The ARC HS Development Kit (HSDK) contains an ASIC implementation of a quad-core ARC HS Processor and a rich set of peripherals including Ethernet, WiFi, Bluetooth, USB, I2C, SPI and UART. It is supported by the tools and software necessary to develop Linux, RTOS and bare-metal applications. Linux distributions can be built for the platform using the open-source Yocto Project and RTOS & bare metal applications can be developed with the embARC Open Software Platform (OSP). This demonstration will show the ARC HSDK in action, running OpenArena, as a great way to get started quickly with software development on ARC processors.
Object Detection & Tracking Using 草榴社区 EV6x Processors – 草榴社区
The 草榴社区 EV6x Embedded Vision Processor’s vector DSP and CNN engine combine deep learning with traditional computer vision. The demo shows how the tightly-integrated CNN engine executes deep learning inference (using TinyYOLO) along with the vector processor, which tracks points in the image using sparse optical flow algorithms.