What is it?

The DataPlatform Franchising project is dedicated to applying the concept of Franchising to Data Asset. To paraphrase Wikipedia's definition of franchising:

For the data franchisor, the franchise is an alternative to building chained stores data stores to distribute goods data assets that avoids the investment and liability of a chained stores data store. The franchisor's success depends on the success of the franchisee.

This site is used to document my adventures while experimenting this concept.

Premise

The project is speculating that DataPlateform1 giving access to cleansed dataset integrating multiple 3rd party data will become prevalent to many analytical projects.

This is considering:

  1. Data Preparation this is costly but mandatory when integrating multi-source data and is better left to a dedicated and specialized team
  2. Cloud-based cost model its zero initial investment and pay-per-usage policy, the Cloud will also be attractive to the data analytics domain
  3. Cloud computing scalability linear and near "infinite" scalability is needed to open analytical capability to much larger audience
  4. Data asset integration single source dataset has limited value by itself.. the famous saying "The whole is greater than the sum of its parts”
  5. A bit of idealism! modern world is dense, interconnected and interdependent.. striving for more collaboration and sharing dataset should prevail over competitive argument!

Refer to context for more details.

DataPlatform offering

The project is dedicated to create DataPlatforms with any kind of integrated dataset. It takes care of all Data Preparation work in place of Data Providers, as well as visualisation/dashboarding work to offer BI analytics as a self-service available to non-expert.

It is the place to find LIVE, CLEANSED, STRUCTURED, MODELED, CONSISTENT and INTEGRATED multi-source dataset accessible in the Cloud! There are many large datasets available like BigQuery dataset, or AWS dataset, but these are raw data derived from a single source.

This project is about providing r​ich data and not only big data!

Benefit

Data provider

Designed for anyone interested in capitalizing on his valuable data asset, but put off by the cost associated with data preparation, hosting service, operation and maintenance .

DataPlatform Franchising (or dataPFranc) project takes care of the technical, tedious and specialized tasks involved in implementing and hosting DataPlatform service using a shared-revenue model.

Benefits foreseen are:

  • Easily capitalize on your data assets without investing on development and infrastructure
  • Enrich your data asset’s value through integration/mashup with other source
  • Publish dataset with restricted license to secure collaboration or discover new partnership leads
  • Enhance your own analytics by integrating external source of data

Data consumer

Data consumer will benefit from:

  • Accessing cleansed and integrated dataset with no initial cost
  • Accessing sophisticated analytical techniques through pre-built visualisation/dashboard tools developed on top of the DataPlatform
  • Reuse your BI tool investment by connecting them through standard API access
  • Pay only for what you consume

Technology vendor

On the front-end side, visualisation and dashboarding analytical tools like QlikView, Tableau and Looker could showcase their capabilities by connecting to Cloud-based DataPlatform through standard API and language (JDBC, ODBC, Python DB-API, and SQL).

On the back-end side, a Cloud-based solution providers like Redshift from Amazon, Azure SQL Data Warehouse from Microsoft and Elastic Data Warehouse from Snowflake could be interested to demonstrate their engine's scalability suiting much larger public-access.


  1. DataPlatform generic term is chosen to avoid debating on BI platform architectures. Call it data warehouse, datamart, data lake, olap cube, star schema, datahub,... the main focus is on access not physical design.