MKPL MicrosoftSQLMetadatatoCatalog 180219 1849 1876
MKPL MicrosoftSQLMetadatatoCatalog 180219 1849 1876
MKPL MicrosoftSQLMetadatatoCatalog 180219 1849 1876
Short Description This solution approach will leverage the Catalog SQL Server jdbc connector to ingest metadata directly from the
Microsoft SQL Server storage tier.
Overview
Functional Design
Installation
Configuration
Usage
Release History
Troubleshooting
Overview
Microsoft SQL Server is a relational database management system. As a database server, it is a software product with the primary function of
storing and retrieving data as requested by other software applications—which may run either on the same computer or on another computer
across a network. This solution approach will leverage the Catalog SQL Server jdbc connector to ingest metadata directly from the Microsoft SQL
Server storage tier.
MS SQL doc
Functional Design
Asset types used for integration:
Asset Description
type
Database A collection of data that is systematically organised or structured in order to make it easy to create, update and query the
information. Examples: Ora_DGC_V45, SalesDB2020
Schema An organised structure described in a formal language supported by implementing technology that defines the objects in the
technology assets (Table and columns in a relational database, fields in a file). E.g. CRM_001_PRD, HDP_CNT_CLD
Table An implementation of Data Entities in columns and rows, in a given database system. It is the basic structure of a relational
database. Examples: Account_tbl, CUST_ADDR
Column An atomic unit of data that can be stored in a database table. Examples: FST_NM, EMPID
More information is here: JDBC integration metadata model.xlsx.zip
Installation
To perform data source ingestion, JDBC driver should be added to instance.
Configuration
For Microsoft SQL Data source JDBC-driver is added to the product by default.
But there is a possibility to add other JDBC-drivers also:
4. Add schema name (mandatory), description (optional) and select owner (by default, current loged-in user)
5. Click on drop-down under the title JDBC driver version" and select "manage drivers" (more information can be found in "Add and
configure JDBC driver for data source registration")
5.
Usage
1. Open wizard and select "Register Data Source" -> "SQL Server" (more information is in "Registering a data source")
2. Add information about Schema
3. Select Job Server and add needed credentials
4. Now there are possibilities to choose and perform "Store Data Profiling", "Detect advanced data type", "Store Sample Data" or exclude
from the registration process some of the tables.
4.
5. As a final step you can use with the result of ingestion for your purposes.
Release History
v 5.0 -- Initial release. List main features:
10/18/2016
Ingest data
Assemble data sets
Enrich data sets
Shop for data
v 5.4 -- 06/08/ The Collibra Catalog Home page is designed to help you quickly and easily find Catalog-related assets
2018
v 5.6 -- 01/25/ New layout of Catalog pages to improve the user experience (Data sets, Tables, Schemas)
2019 Support for multiple Jobservers: install a Jobserver close to your data sources, even when they are in different network
silos, to increase the performance
An API to store profiling information in Catalog, even if Catalog doesn't natively support the data source.
Troubleshooting