For many securities organizations, post-trade processing is expensive, cumbersome, and time-consuming. This is in part due to the massive volumes of data required for processing a trade and the limited agility of the technology on which many organizations rely today. In order to create efficiencies and move faster, many financial services organizations are working with AWS to implement post-trade solutions built with AWS storage services (Amazon S3 and Amazon Glacier) and big data capabilities (Amazon Athena, Amazon EMR, Amazon Redshift, and Amazon QuickSight ). In this session, we walk through a trade capture and regulatory reporting solution that uses the aforementioned AWS services. We also provide guidance around obtaining data-driven insights (from pixels to pictures); bolstering encryption with AWS KMS; and maintaining transparency and control with Amazon CloudWatch and Amazon CloudTrail (which also helps meet SEC Rule 613 that requires the creation of comprehensive consolidated audit trails).
8. A More Strategic Approach to Reporting
Financial institutions are viewing their reporting obligations as a catalyst to pursue
broader data management objectives that can help unlock the value of their data.
Business
benefits
Enhanced data
governance
Improved
efficiency
13. CAT Reporting Pipeline on AWS
Business
Intelligence
FIX
Messages
Single
Source of
Truth
Transform
and
Optimize
Optimized
Data
Repository
Transaction
Linking and
Transformation
Regulatory
Report
Ad-hoc Data
Analysis
FIX Ingestion
Transform FIX to Parquet
CAT Reporting
Trade Analytics
14. Region
Multipart
upload of
encrypted
data
Amazon
S3 data
lake
Transient Amazon
EMR Clusters for
ETL
Cleansed,
Formatted,
Split,
Compressed
Output
Internal App
On
premises
On-premises HSM
(optional)
CloudWatch Alarm AWS CloudTrail
Amazon
Glacier
(WORM
storage)
AWS KMS
CAT Reporting Architecture on AWS
BYO Key
Amazon
S3 Data
Warehouse
Transient
Amazon EMR
Clusters for Event
Sequencing
CAT
output
herd Metadata
Store
19. Region
Multipart
upload of
encrypted
data
S3 data
lake
Transient EMR
Clusters for ETL
Cleansed,
Formatted,
Split,
Compressed
Output
Internal App
On
premises
On-premises HSM
(optional)
CloudWatch Alarm CloudTrail
Amazon
Glacier
(WORM
storage)
AWS
KMS
Lineage
BYO Key
S3 Data
Warehouse
Transient EMR
Clusters for Event
Sequencing
CAT
output
herd Metadata
Store
20. Lineage Framework – herd
Unified data catalog
A centralized, auditable
catalog for operational
usage and data governance
Track lineage
Capture data ancestry for
regulatory, forensic, and
analytical purposes
herd is a FINRA-built, open-source framework that tracks and catalogs data in a
unified data repository in order to capture audit and data lineage information
30. Region
Multipart
upload of
encrypted
data
S3 data
lake
Transient EMR
Clusters for ETL
Cleansed,
Formatted,
Split,
Compressed
Output
Internal App
On
premises
On-premises HSM
(optional)
CloudWatch Alarm CloudTrail
Amazon
Glacier
(WORM
storage)
KMS
Reporting and Analytics
BYO Key
S3 Data
Warehouse
Transient EMR
Clusters for Event
Sequencing
CAT
output
HERD Metadata
Store