19. D3 @ Alethe
D3 is a team of Data Scientists, Analysts and Developers
at Alethe which takes care of all the Data aggregation,
processing, analytics & visualization operations.
20. Aggregation
Data Aggregation
Identifying the data sources helpful to improve business is the
first step if any data analytics implementation. Data aggregation
includes
• Identifying internal & external data sources like databases,
logs, twitter streams, sensor data
• Aggregating these data sets in big data cluster. Preprocess
data incase it requires cleansing and formatting.
• Create appropriate data store for easy access of data when
required
• Combine static data sets, archives and streaming data under
one unified big data store
21. Processing
Data Processing
Not all data sources publish data in ready to analyze form. Data
processing makes the data ready by
• Identifying all data formats and their compatibility with each
other
• Combine multiple data sets without losing information e.g.
combining multiple log files for running complex analytics job
• Dropping redundant information identified as junk
22. Analytics
Data Analytics
Analytics reveals insight hidden inside data. Data analytics
covers
• Creating algorithms and scripts to analyze huge data sets in
parallel
• Create rules based on historical data analytics that can be
used as-it-is or can be combined with latest data sets to
generate recommendations
• Predictive data analysis to become informed about the
possible future with occurrence probability
• Publish results in format suitable for various visualization
techniques e.g. graphs, interactive map etc.
23. Decision Science
Data Decision Science
Data decision science explores recommendations generated
from analytics and
• Publish suggestive actions and the probability of their
success
• Help business & operations team understand the analytics
recommendations
• Map suggestions to actions with the operational team
24. EDB @ Alethe
Enterprise Data Bag including Apache Hadoop is Big
Data platform for data consumers to process and analyze
large data sets and extract business insights and value
from it.
25. Operations
Operation Management
EDB Data cluster is made for heavy data churning and capable
of running 1000s to parallel data processing and analytics jobs
in parallel. The Operations Manager ensures a bird’s eye view of
all the events/status related to
• Cluster Health
• User Access
• Data Processing Events
• Data Analytics Events
• Error Reporting & Warnings
EDB Operations manager provides easy access through custom
designed UI. Also the events displayed can be customized for
different user types.
26. Data Manager
Data Management
EDB Data Manager gives GUI access for all the tasks related to
• Data import into EDB cluster from external sources like
RDBMS, FTP, Twitter, Logs etc.
• Data export from EDB cluster to external sources
• Data access on user dashboard to large data sets residing in
EDB distributed cluster
• Data access rights management
• Data preview for a user looking to analyze a data set or just
verifying the data
Data Manager provides custom & prebuilt templates to create
data import and export jobs with ease.
27. Analytics
Data Analytics
EDB Data Analytics manager (ADAM™) provides single window
access to all data analytics job irrespective of the type of data
i.e. structured or unstructured data. ADAM takes care of
• Analyzing RDBMS data from multiple databases like SQL,
ORACLE, Postgres etc.
• Analyzing unstructured data sets to find business insights or
• Combine the intelligence from databases with unstructured
data for more accurate results
ADAM comes with prebuilt templates for data analytics jobs. The
templates can be used as-it-is or can be customized for more
complex data analytics.
28. Visualization
Data Visualization
EDB Data Visualization connectors help the business user to
connect the analyzed data sets from the EDB cluster to many
visualization software. User can also visualize data from the
inbuilt visualization libraries of EDB. EDB visualization provides
• Interactive reports with post analytics filters to better
understand the results and even focus on particular output
• Export functionality for analyzed data in various formats like
HTML, PDF, Excel etc.
Data visualization provides pre-built modules that can be further
customized for granular reports.