Chapter 8
Chapter 8
Chapter 8
Roles and
Responsibilities
Skills and Responsibilities
• The most common roles are data engineer, data architect, data
scientist, and analyst.
• Objective is to make this data accessible and broadly usable for a wide
range of analyses.
• data architects often create catalogs for this data to improve its
discoverability and usability.
Data Architect
• Further optimizations to the data and the catalog involve the creation
of naming conventions and standard documentation practices, and
then applying and enforcing these practices.
Skills and methods – Data Architect
• Data architects often work through a user requirements-gathering
process.
• These insights might derive from existing data using advanced statistical
analyses or from the application of machine learning algorithms.
• Both are generally tasked with finding deep and complex insights.
• Additionally, data scientists require the skills to operate the tools that
can apply these algorithms, such as R, SAS, Python, SPSS, and so on.
• In some cases, these take the form of top-line metrics, KPIs, to drive
or orient the organization.
Analyst
• The line between an analyst and a data scientist can be blurry.
• they are able to connect insights that might be co-relevant and then
propose ways to measure the extent of their relationships.
Roles Across the Data Workflow Framework
• 1. Ingesting data
• 2. Describing data
• 3. Assessing data utility
• 4. Designing and building refined data
• 5. Ad hoc reporting
• 6. Exploratory modeling and forecasting
• 7. Designing and building optimized data
• 8. Regular reporting
• 9. Building products and services
• Data engineers, with their focus on data systems, generally drive the
data ingestion and data description in the raw data stage.
• Data architects and data engineers are responsible for designing and
building the optimized datasets.
• Analysts, with the help of the data engineers, drive the reporting efforts.
• Datascientists, also with the help of data engineers, work to deliver the
data for products and service
Organizational Best Practices