Getting started with amazon redshift - Toronto
-
Upload
amazon-web-services -
Category
Software
-
view
543 -
download
1
Transcript of Getting started with amazon redshift - Toronto
![Page 1: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/1.jpg)
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Eugene Kawamoto, Head of Business Development
September 28th, 2016
Getting Started with
Amazon Redshift
![Page 2: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/2.jpg)
Agenda
• Introduction
• Benefits
• Use cases
• Getting started
• Q&A
![Page 3: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/3.jpg)
AnalyzeStore/Process
Amazon
Glacier
Amazon S3
Amazon
DynamoDB
Amazon RDS,
Amazon Aurora
AWS big data portfolio
AWS Data Pipeline
Amazon
CloudSearch
Amazon EMR Amazon EC2
Amazon
Redshift
Amazon
Machine
Learning
Amazon
Elasticsearch
Service
AWS Database
Migration Service
Amazon
QuickSight
Amazon
Kinesis
Firehose
AWS Import/Export
AWS Direct
Connect
Collect
Amazon Kinesis
Streams
![Page 4: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/4.jpg)
Relational data warehouse
Massively parallel; petabyte scale
Fully managed
HDD and SSD platforms
$1,000/TB/year; starts at $0.25/hour
Amazon
Redshift
a lot faster
a lot simpler
a lot cheaper
![Page 5: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/5.jpg)
The Amazon Redshift view of data warehousing
10x cheaper
Easy to provision
Higher DBA productivity
10x faster
No programming
Easily leverage BI tools,
Hadoop, machine learning,
streaming
Analysis inline with process
flows
Pay as you go, grow as you
need
Managed availability and
disaster recovery
Enterprise Big data SaaS
![Page 6: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/6.jpg)
The Forrester Wave™ is copyrighted by Forrester Research, Inc. Forrester and Forrester Wave™ are trademarks of Forrester Research, Inc. The Forrester Wave™ is a graphical
representation of Forrester's call on a market and is plotted using a detailed spreadsheet with exposed scores, weightings, and comments. Forrester does not endorse any
vendor, product, or service depicted in the Forrester Wave. Information is based on best available resources. Opinions reflect judgment at the time and are subject to change.
Forrester Wave™ Enterprise Data Warehouse Q4 ’15
![Page 7: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/7.jpg)
Selected Amazon Redshift customers
![Page 8: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/8.jpg)
Amazon Redshift architecture
Leader node
Simple SQL endpoint
Stores metadata
Optimizes query plan
Coordinates query execution
Compute nodes
Local columnar storage
Parallel/distributed execution of all queries, loads,
backups, restores, resizes
Start at just $0.25/hour, grow to 2 PB (compressed)
DC1: SSD; scale from 160 GB to 326 TB
DS2: HDD; scale from 2 TB to 2 PB
Ingestion/Backup
Backup
Restore
JDBC/ODBC
10 GigE
(HPC)
![Page 9: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/9.jpg)
Benefit #1: Amazon Redshift is fast
Dramatically less I/O
Column storage
Data compression
Zone maps
Direct-attached storage
Large data block sizes
analyze compression listing;
Table | Column | Encoding
---------+----------------+----------
listing | listid | delta
listing | sellerid | delta32k
listing | eventid | delta32k
listing | dateid | bytedict
listing | numtickets | bytedict
listing | priceperticket | delta32k
listing | totalprice | mostly32
listing | listtime | raw
10 | 13 | 14 | 26 |…
… | 100 | 245 | 324
375 | 393 | 417…
… 512 | 549 | 623
637 | 712 | 809 …
… | 834 | 921 | 959
10
324
375
623
637
959
![Page 10: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/10.jpg)
Benefit #1: Amazon Redshift is fast
Parallel and distributed
Query
Load
Export
Backup
Restore
Resize
![Page 11: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/11.jpg)
Benefit #1: Amazon Redshift is fast
Hardware optimized for I/O intensive workloads, 4 GB/sec/node
Enhanced networking, over 1 million packets/sec/node
Choice of storage type, instance size
Regular cadence of autopatched improvements
![Page 12: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/12.jpg)
Benefit #1: Amazon Redshift is fast
New Dense Storage (HDD) instance type
Improved memory 2x, compute 2x, disk throughput 1.5x
Cost: Same as our prior generation!
Performance improvement: 50%
Enhanced I/O and commit improvements (Jan ’16)
Reduce amount of time to commit data
Performance improvement: 35%
![Page 13: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/13.jpg)
Benefit #2: Amazon Redshift is inexpensive
Ds2 (HDD)Price per hour for
DW1.XL single node
Effective annual
price per TB compressed
On-demand $ 0.850 $ 3,725
1 year reservation $ 0.500 $ 2,190
3 year reservation $ 0.228 $ 999
Dc1 (SSD)Price per hour for
DW2.L single node
Effective annual
price per TB compressed
On-demand $ 0.250 $ 13,690
1 year reservation $ 0.161 $ 8,795
3 year reservation $ 0.100 $ 5,500
Pricing is simple
Number of nodes x price/hour
No charge for leader node
No upfront costs
Pay as you go
![Page 14: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/14.jpg)
Benefit #3: Amazon Redshift is fully managed
Continuous/incremental backups
Multiple copies within cluster
Continuous and incremental backups
to Amazon S3
Continuous and incremental backups
across regions
Streaming restore
Amazon S3
Amazon S3
Region 1
Region 2
![Page 15: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/15.jpg)
Benefit #3: Amazon Redshift is fully managed
Amazon S3
Amazon S3
Region 1
Region 2
Fault tolerance
Disk failures
Node failures
Network failures
Availability Zone/region level disasters
![Page 16: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/16.jpg)
Benefit #4: Security is built-in
• Load encrypted from S3
• SSL to secure data in transit
• Amazon VPC for network isolation
• Encryption to secure data at rest
• All blocks on disks and in S3 encrypted
• Block key, cluster key, master key (AES-256)
• On-premises HSM & AWS CloudHSM support
• Audit logging and AWS CloudTrail integration
• SOC 1/2/3, PCI-DSS, FedRAMP, BAA
10 GigE
(HPC)
Ingestion
Backup
Restore
Customer VPC
Internal
VPC
JDBC/ODBC
![Page 17: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/17.jpg)
Benefit #5: We innovate quickly
Well over 100 new features added since launch
Release every two weeks
Automatic patching
Service Launch (2/14)
PDX (4/2)
Temp Credentials (4/11)
DUB (4/25)
SOC1/2/3 (5/8)
Unload Encrypted Files
NRT (6/5)
JDBC Fetch Size (6/27)
Unload logs (7/5)
SHA1 Builtin (7/15)
4 byte UTF-8 (7/18)
Sharing snapshots (7/18)
Statement Timeout (7/22)
Timezone, Epoch, Autoformat (7/25)
WLM Timeout/Wildcards (8/1)
CRC32 Builtin, CSV, Restore Progress (8/9)
Resource Level IAM (8/9)
PCI (8/22)
UTF-8 Substitution (8/29)
JSON, Regex, Cursors (9/10)
Split_part, Audit tables (10/3)
SIN/SYD (10/8)
HSM Support (11/11)
Kinesis EMR/HDFS/SSH copy, Distributed Tables, Audit
Logging/CloudTrail, Concurrency, Resize Perf., Approximate Count Distinct, SNS
Alerts, Cross Region Backup (11/13)
Distributed Tables, Single Node Cursor Support, Maximum Connections to 500
(12/13)
EIP Support for VPC Clusters (12/28)
New query monitoring system tables and diststyle all (1/13)
Redshift on DW2 (SSD) Nodes (1/23)
Compression for COPY from SSH, Fetch size support for single node clusters, new
system tables with commit stats, row_number(), strotol() and query
termination (2/13)
Resize progress indicator & Cluster Version (3/21)
Regex_Substr, COPY from JSON (3/25)
50 slots, COPY from EMR, ECDHE ciphers (4/22)
3 new regex features, Unload to single file, FedRAMP(5/6)
Rename Cluster (6/2)
Copy from multiple regions, percentile_cont, percentile_disc (6/30)
Free Trial (7/1)
pg_last_unload_count (9/15)
AES-128 S3 encryption (9/29)
UTF-16 support (9/29)
![Page 18: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/18.jpg)
Benefit #6: Redshift is powerful
• User defined functions
• Data science
• Machine learning
• Approximate functions
![Page 19: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/19.jpg)
Benefit #7: Amazon Redshift has a large ecosystem
Data integration Systems integratorsBusiness intelligence
![Page 20: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/20.jpg)
Benefit #8: Service oriented architecture
DynamoDB
EMR
S3
EC2/SSH
RDS/Aurora
Amazon
Redshift
Amazon Kinesis
Amazon ML
Data Pipeline
CloudSearch
Amazon
Mobile
Analytics
![Page 21: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/21.jpg)
Use cases
![Page 22: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/22.jpg)
NTT Docomo: Japan’s largest mobile service provider
68 million customers
Tens of TBs per day of data across a
mobile network
6 PB of total data (uncompressed)
Data science for marketing
operations, logistics, and so on
Greenplum on-premises
Scaling challenges
Performance issues
Need same level of security
Need for a hybrid environment
![Page 23: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/23.jpg)
125 node DS2.8XL cluster
4,500 vCPUs, 30 TB RAM
2 PB compressed
10x faster analytic queries
50% reduction in time for new
BI application deployment
Significantly less operations
overhead
Data
Source
ET
AWS
Direct
Connect
Client
Forwarder
LoaderState
Management
SandboxAmazon Redshift
S3
NTT Docomo: Japan’s largest mobile service provider
![Page 24: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/24.jpg)
Nasdaq: powering 100 marketplaces in 50 countries
Orders, quotes, trade executions,
market “tick” data from 7 exchanges
7 billion rows/day
Analyze market share, client activity,
surveillance, billing, and so on
Microsoft SQL Server on-premises
Expensive legacy DW
($1.16 M/yr.)
Limited capacity (1 yr. of data
online)
Needed lower TCO
Must satisfy multiple security
and regulatory requirements
Similar performance
![Page 25: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/25.jpg)
23 node DS2.8XL cluster
828 vCPUs, 5 TB RAM
368 TB compressed
2.7 T rows, 900 B derived
8 tables with 100 B rows
7 man-month migration
¼ the cost, 2x storage, room to
grow
Faster performance, very
secure
Nasdaq: powering 100 marketplaces in 50 countries
![Page 26: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/26.jpg)
Getting started
![Page 27: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/27.jpg)
Provisioning
![Page 28: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/28.jpg)
Enter cluster details
![Page 29: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/29.jpg)
Select node configuration
![Page 30: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/30.jpg)
Select security settings and provision
![Page 31: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/31.jpg)
Point-and-click resize
![Page 32: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/32.jpg)
Resize
• Resize while remaining online
• Provision a new cluster in the
background
• Copy data in parallel from node to
node
• Only charged for source cluster
![Page 33: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/33.jpg)
Data modeling
![Page 34: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/34.jpg)
SELECT COUNT(*) FROM LOGS WHERE DATE = ‘09-JUNE-2013’
MIN: 01-JUNE-2013
MAX: 20-JUNE-2013
MIN: 08-JUNE-2013
MAX: 30-JUNE-2013
MIN: 12-JUNE-2013
MAX: 20-JUNE-2013
MIN: 02-JUNE-2013
MAX: 25-JUNE-2013
Unsorted table
MIN: 01-JUNE-2013
MAX: 06-JUNE-2013
MIN: 07-JUNE-2013
MAX: 12-JUNE-2013
MIN: 13-JUNE-2013
MAX: 18-JUNE-2013
MIN: 19-JUNE-2013
MAX: 24-JUNE-2013
Sorted by date
Zone maps
![Page 35: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/35.jpg)
• Single column
• Compound
• Interleaved
![Page 36: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/36.jpg)
Single Column• Table is sorted by 1 column
Date Region Country
2-JUN-2015 Oceania New Zealand
2-JUN-2015 Asia Singapore
2-JUN-2015 Africa Zaire
2-JUN-2015 Asia Hong Kong
3-JUN-2015 Europe Germany
3-JUN-2015 Asia Korea
[ SORTKEY ( date ) ]
• Best for:
• Queries that use 1st column (i.e. date) as primary filter
• Can speed up joins and group bys
• Quickest to VACUUM
![Page 37: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/37.jpg)
Compound
• Table is sorted by 1st column , then 2nd column etc.
Date Region Country
2-JUN-2015 Africa Zaire
2-JUN-2015 Asia Korea
2-JUN-2015 Asia Singapore
2-JUN-2015 Europe Germany
3-JUN-2015 Asia Hong Kong
3-JUN-2015 Asia Korea
[ SORTKEY COMPOUND ( date, region, country) ]
• Best for:
• Queries that use 1st column as primary filter, then other cols
• Can speed up joins and group bys
• Slower to VACUUM
![Page 38: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/38.jpg)
Interleaved
• Equal weight is given to each column.
Date Region Country
2-JUN-2015 Africa Zaire
3-JUN-2015 Asia Singapore
2-JUN-2015 Asia Korea
2-JUN-2015 Europe Germany
3-JUN-2015 Asia Hong Kong
2-JUN-2015 Asia Korea
[ SORTKEY INTERLEAVED ( date, region, country) ]
• Best for:
• Queries that use different columns in filter
• Queries get faster the more columns used in the filter
• Slowest to VACUUM
![Page 39: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/39.jpg)
• KEY
• ALL
• EVEN
![Page 40: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/40.jpg)
ID Gender Name
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M James White
306 F Lisa Green
2
3
4
ID Gender Name
101 M John Smith
306 F Lisa Green
ID Gender Name
292 F Jane Jones
209 M James White
ID Gender Name
139 M Peter Black
164 M Brian Snail
ID Gender Name
446 M Pat Partridge
658 F Sarah Cyan
RoundRobin
DISTSTYLE EVEN
![Page 41: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/41.jpg)
ID Gender Name
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M James White
306 F Lisa Green
HashFunction
ID Gender Name
101 M John Smith
306 F Lisa Green
ID Gender Name
292 F Jane Jones
209 M James White
ID Gender Name
139 M Peter Black
164 M Brian Snail
ID Gender Name
446 M Pat Partridge
658 F Sarah Cyan
DISTSTYLE KEY
![Page 42: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/42.jpg)
ID Gender Name
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M James White
306 F Lisa Green
HashFunction
ID Gender Name
101 M John Smith
139 M Peter Black
446 M Pat Partridge
164 M Brian Snail
209 M James White
ID Gender Name
292 F Jane Jones
658 F Sarah Cyan
306 F Lisa Green
DISTSTYLE KEY
![Page 43: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/43.jpg)
ID Gender Name
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M James White
306 F Lisa Green
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M Lisa Green
306 F James White
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M Lisa Green
306 F James White
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M Lisa Green
306 F James White
101 M John Smith
292 F Jane Jones
139 M Peter Black
446 M Pat Partridge
658 F Sarah Cyan
164 M Brian Snail
209 M Lisa Green
306 F James White
ALL
DISTSTYLE ALL
![Page 44: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/44.jpg)
• KEY
• Large Fact tables
• Large dimension tables
• ALL
• Medium dimension tables (1K – 2M)
• EVEN
• Tables with no joins or group by
• Small dimension tables (<1000)
![Page 45: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/45.jpg)
Loading data
![Page 46: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/46.jpg)
AWS CloudCorporate Data center
Amazon S3Amazon
Redshift
Flat files
Data loading options
![Page 47: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/47.jpg)
AWS CloudCorporate Data center
ETL
Source DBs
Amazon
Redshift
Amazon
Redshift
Data loading options
![Page 48: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/48.jpg)
AWS Cloud
Amazon
RedshiftAmazon
Kinesis
Data loading options
![Page 49: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/49.jpg)
Use multiple input files to maximize
throughput
Use the COPY command
Each slice can load one file at a
time
A single input file means only one
slice is ingesting data
Instead of 100MB/s, you’re only
getting 6.25MB/s
![Page 50: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/50.jpg)
Use multiple input files to maximize
throughputUse the COPY command
You need at least as many input files as you have slices
With 16 input files, all slices are working so you maximize throughput
Get 100MB/s per node; scale linearly as you add nodes
![Page 51: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/51.jpg)
Querying
![Page 52: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/52.jpg)
JDBC/ODBC
Amazon Redshift
Amazon Redshift works with your existing
analysis tools
![Page 53: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/53.jpg)
ODBC/JDBC
BI ClientsRedshift
![Page 54: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/54.jpg)
ODBC/JDBC
BI Server Redshift
Clients
![Page 55: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/55.jpg)
Monitor query performance
![Page 56: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/56.jpg)
View query execution plans
![Page 57: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/57.jpg)
Resources
Detail Pages
• http://aws.amazon.com/redshift
• https://aws.amazon.com/marketplace/redshift/
Best Practices
• http://docs.aws.amazon.com/redshift/latest/dg/c_loading-data-best-practices.html
• http://docs.aws.amazon.com/redshift/latest/dg/c_designing-tables-best-practices.html
• http://docs.aws.amazon.com/redshift/latest/dg/c-optimizing-query-performance.html
![Page 58: Getting started with amazon redshift - Toronto](https://reader031.fdocuments.us/reader031/viewer/2022030315/587d53ba1a28abee158b521d/html5/thumbnails/58.jpg)
Thank you!