Dataset - Stake Pool Analytics
current project status
Current Project Status
unfunded
Total
amount
Received
$5040
Total
amount
Requested
$10080
Total
Percentage
Received
50%
$5040 Received out of $10080
Solution
Open-source code repository for generating an analytics dataset from a mixture of on chain db-sync data and external data sources
Problem
Full historical stake pool datasets for analysis and machine learning are currently not available through block explorers or existing sites
Impact alignment
Feasibility
Value for money

Detailed Plan

<u>Problem Summary</u>

There are many opportunities to perform data analysis on pool performance, but a lack of analytics-ready data for the community to access.

<u>Our Solution</u>

Our plan is to build an open-source analytics-ready dataset of Cardano stake pool data which provides the community a consolidated machine-learning ready dataset for the Cardano ecosystem

<u>Data Sources</u>

  • Cardano DB-Sync
  • SMASH Metadata
  • Extended metadata json files

The aim of this dataset will be to enable someone to answer such questions as:

  • What are the best performing stake-pools
  • What is the effect of pool parameter changes to performance over time
  • What are the optimal stake pool parameters to attract stake?
  • What's the rate of growth of stake pools?
  • What's the rate of attrition of stake pools?
  • What is my pledge earning vs my stake pool?

<u>Deliverables</u>

The output of this project would be an open-source Github repository containing data integration scripts and SQL Queries as well as associated schema documentation.

SQL scripts would be separated between DB-Sync only scripts, and extended scripts which require external data sources to be integrated so that developers who are already running DBSync can get maximum value without implementing the data integration components.

Additionally, if the Cardano Data Hub idea ( <https://cardano.ideascale.com/a/dtd/Cardano-Analytics-Data-Hub/368258-48088> ) gets funded, this dataset will be integrated as one of the available datasets there.

<u>Project Plan and Budget</u>

The project will have 5 components:

Data Discovery / Requirements - $1,800

  • detailed data review and profiling of existing data sources
  • Collaboration with community on requirements

Data Modelling - $1,800

  • Modelling datasets based on available data and community input

Data Integration - $3,000

  • Creating data integration scripts to ingest external datasets

SQL Development - $1,800

  • Develop SQL views to aggregate and implement modelled data

Documentation - $1,200

  • Provide schema and user documentation for consumers and implementers of the data

Quality Assurance / Testing - $480

  • Total Budget: $10,080

Expected completion date would be 6-8 weeks after funding is received.

<u>Core Team Experience</u>

Michael Stewart

  • 17 years of software development and architecture experience.
  • The last 10 years of focus in the data and analytics space.
  • Led the development team of a boutique data / analytics firm where I designed and architected cloud based data warehouse solutions for fortune 500 companies
  • Member of the Cardano community since 2017
  • Co-Founder of Cardano Canucks stake pool and Canuckz NFTs
  • Co-Founder of CCSPA (Canadian Cardano Stake Pool Association)

Vivek Nankissoor

  • 15+ years of experience in database requirements, design and development
  • Established and grew web analytics, marketing automation and QA practices
  • Engaged in marketing, data and analytics strategy development with enterprise retail, cpg organizations, banks, automotive, pharma, fintech and others
  • Co-Founder of Cardano Canucks stake pool and Canuckz NFTs
  • Co-Founder of CCSPA (Canadian Cardano Stake Pool Association)
  • Participant in community work such as financial literacy relating to crypto and raising awareness with various investment groups

Community Reviews (1)

Comments

close

Playlist

  • EP2: epoch_length

    Authored by: Darlington Kofa

    3m 24s
    Darlington Kofa
  • EP1: 'd' parameter

    Authored by: Darlington Kofa

    4m 3s
    Darlington Kofa
  • EP3: key_deposit

    Authored by: Darlington Kofa

    3m 48s
    Darlington Kofa
  • EP4: epoch_no

    Authored by: Darlington Kofa

    2m 16s
    Darlington Kofa
  • EP5: max_block_size

    Authored by: Darlington Kofa

    3m 14s
    Darlington Kofa
  • EP6: pool_deposit

    Authored by: Darlington Kofa

    3m 19s
    Darlington Kofa
  • EP7: max_tx_size

    Authored by: Darlington Kofa

    4m 59s
    Darlington Kofa
0:00
/
~0:00