Amazon Web Services
Big Data Day Camp
Real-Time IoT - MARCH 20-21, 2019
Distinguished Speakers in Communication Theory & Systems
Seminar Series Fall 2018

2018-2019 Schedule

December 3, 2018
Machine Learning on AWS with SageMaker

January 16, 2019
AWS – Building a Data Lake for Analytics and Machine Learning

February 5, 2019
Introduction to AWS IoT with Greengrass

March 6, 2019
AWS SPOT: Cost Effectively Add More Compute Resources

April 9, 2019
How to Set Up Genomics Workflows on the AWS Cloud


How to Set Up Genomics Workflows on the AWS Cloud

April 9, 2019 |  10:30 a.m. – 12:30 p.m.
Atkinson Hall, Qualcomm Institute, UC San Diego

Deriving insights from data is foundational to nearly every organization, and many researches process high volumes of data every day. One common requirement of customers in life sciences is the need to analyze these data in a high-throughput fashion without sacrificing time-to-insight. Such analyses, which tend to be composed of a series of massively parallel processes (MPP) are well suited to the AWS Cloud.

In this session, we will introduce you to AWS and then show you how to set up genomics workflows on AWS. We will also show to users how to optimize Amazon EC2 Spot Instances use and save up to 90% off of traditional On-Demand prices.

Note: this approach to batch processing can be generalized to any type of batch workflow so anyone is welcome to attend.


SageMaker Workshop – Build, Train and Deploy ML Models at Scale with Amazon SageMaker

In this tutorial participants learn to solve Machine / Deep Learning problems using the tools available in the Amazon Web Services (AWS) cloud. The development and application of machine learning models is a vital part of scientific and technical computing. Increasing model training data size generally improves model prediction and performance, but deploying models at scale is a challenge. Participants will learn to use Amazon SageMaker, a new AWS service that simplifies the machine learning process and enables training on cloud stored datasets at any scale.

Introduction to AWS Basics

Learn about core AWS services for compute, storage, database and networking. We will also do a hands-on lab where you will be able to launch AWS virtual machines (EC2 instances) and create your first S3 storage bucket.

AWS Serverless Applications in Python

AWS Serverless Applications in Python: With AWS Serverless computing you can run applications and services without having to provision, scale, and manage any servers. In this workshop, we will introduce the basics of building serverless applications and microservices using services like AWS Lambda, AWS Step Functions, Amazon API Gateway, Amazon DynamoDB, Amazon Kinesis, and Amazon S3. You’ll learn to build and deploy your own serverless application using these services for common use cases like web applications, analytics, and more.

Machine Learning on AWS with SageMaker

Amazon SageMaker is a fully- managed platform that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. Amazon SageMaker removes all the barriers that typically slow down developers who want to use machine learning. We will show you how to train and build a ML model on SageMaker then how to deploy the inference end points on tools like AWS Greengrass or Serverless applications.

AWS – Building a Data Lake for Analytics and Machine Learning

In this session, we show you how to understand what data you have, how to drive insights, and how to make predictions using purpose-built AWS services. Learn about the common pitfalls of building data lakes, and discover how to successfully drive analytics and insights from your data. Also learn how services such as Amazon S3, AWS Glue, Amazon Redshift, Amazon Athena, Amazon EMR, Amazon Kinesis, and Amazon AI/ML services work together to build a successful data lake for various roles, including data scientists and business users.

Introduction to AWS IoT with Greengrass

AWS IoT services enable you to easily and securely connect and manage billions of devices. You can gather data from, run sophisticated analytics on, and take actions in real-time on your diverse fleet of IoT devices from edge to the cloud. AWS Greengrass is software that lets you run local compute, messaging, data caching, sync, and ML inference capabilities for connected devices in a secure way.

AWS SPOT: Cost Effectively Add More Compute Resources

With Amazon Web Services (AWS), you can spin up EC2 compute capacity on demand with no upfront commitments. You can do this even more cost effectively by using Amazon EC2 Spot Instances to bid on spare Amazon EC2 computing capacity. This allows users to get 90% off on demand prices (often as little as 1c per core hour) and has helped them run very large scale workloads cost effectively. For example, at USC a computational chemist spun up 156,000 core in three days. Also, with the recent release of the Spot fleet API, a researcher or scientist can easily have access to some of the most cost effective compute capacity at a very large scale. Learn how to effectively use these tools for your research needs.

Contact Us

We're not around right now. But you can send us an email and we'll get back to you, asap.

Not readable? Change text. captcha txt

Start typing and press Enter to search