completed

Dataset - Stake Pool Analytics

$10,080.00 Received
$10,080.00 Requested
Ideascale logo View on ideascale
Community Review Results (1 reviewers)
Addresses Challenge
Feasibility
Auditability
Solution

Open-source code repository for generating an analytics dataset from a mixture of on chain db-sync data and external data sources

Problem:

Full historical stake pool datasets for analysis and machine learning are currently not available through block explorers or existing sites

Yes Votes:
₳ 54,808,984
No Votes:
₳ 15,076,497
Votes Cast:
143

This proposal was approved and funded by the Cardano Community via Project F7: Open Source Developer Ecosystem Catalyst funding round.

Detailed Plan

<u>Problem Summary</u>

There are many opportunities to perform data analysis on pool performance, but a lack of analytics-ready data for the community to access.

<u>Our Solution</u>

Our plan is to build an open-source analytics-ready dataset of Cardano stake pool data which provides the community a consolidated machine-learning ready dataset for the Cardano ecosystem

<u>Data Sources</u>

  • Cardano DB-Sync
  • SMASH Metadata
  • Extended metadata json files

The aim of this dataset will be to enable someone to answer such questions as:

  • What are the best performing stake-pools
  • What is the effect of pool parameter changes to performance over time
  • What are the optimal stake pool parameters to attract stake?
  • What's the rate of growth of stake pools?
  • What's the rate of attrition of stake pools?
  • What is my pledge earning vs my stake pool?

<u>Deliverables</u>

The output of this project would be an open-source Github repository containing data integration scripts and SQL Queries as well as associated schema documentation.

SQL scripts would be separated between DB-Sync only scripts, and extended scripts which require external data sources to be integrated so that developers who are already running DBSync can get maximum value without implementing the data integration components.

Additionally, the ultimate vision is to integrate this dataset into the the Cardano Data Hub (https://cardanodatahub.com/) as one of the available datasets there.

<u>Project Plan and Budget</u>

The project will have 5 components:

Data Discovery / Requirements - $1,800

  • detailed data review and profiling of existing data sources
  • Collaboration with community on requirements

Data Modelling - $1,800

  • Modelling datasets based on available data and community input

Data Integration - $3,000

  • Creating data integration scripts to ingest external datasets

SQL Development - $1,800

  • Develop SQL views to aggregate and implement modelled data

Documentation - $1,200

  • Provide schema and user documentation for consumers and implementers of the data

Quality Assurance / Testing - $480

Total Budget: $10,080

Expected completion date would be 6-8 weeks after funding is received.

<u>Core Team Experience</u>

Michael Stewart

  • 17 years of software development and architecture experience.
  • The last 10 years of focus in the data and analytics space.
  • Led the development team of a boutique data / analytics firm where I designed and architected cloud based data warehouse solutions for fortune 500 companies
  • Member of the Cardano community since 2017
  • Co-Founder of Cardano Canucks stake pool and Canuckz NFTs
  • Co-Founder of CCSPA (Canadian Cardano Stake Pool Association)

Vivek Nankissoor

  • 15+ years of experience in database requirements, design and development
  • Established and grew web analytics, marketing automation and QA practices
  • Engaged in marketing, data and analytics strategy development with enterprise retail, cpg organizations, banks, automotive, pharma, fintech and others
  • Co-Founder of Cardano Canucks stake pool and Canuckz NFTs
  • Co-Founder of CCSPA (Canadian Cardano Stake Pool Association)
  • Participant in community work such as financial literacy relating to crypto and raising awareness with various investment groups

Community Reviews (1)

Comments

Monthly Reports

n/a

Disbursed to Date
$10,080
Status
Still in progress
Completion Target
4/30/2022
Comments 0

Login or Register to leave a comment!

nope!

Disbursed to Date
$10,080
Status
Still in progress
Completion Target
5/15/2022
Comments 0

Login or Register to leave a comment!

The project is now complete.

Disbursed to Date
$10,080
Status
Complete
Completion Target
6/30/2022
Attachment(s)
Comments 0

Login or Register to leave a comment!

Allocated developer had a family emergency - delayed a little but still on track for end date

Disbursed to Date
$10,080
Status
Complete
Completion Target
8/31/2022
Attachment(s)
Comments 0

Login or Register to leave a comment!

close

Playlist

  • EP2: epoch_length

    Authored by: Darlington Kofa

    3m 24s
    Darlington Kofa
  • EP1: 'd' parameter

    Authored by: Darlington Kofa

    4m 3s
    Darlington Kofa
  • EP3: key_deposit

    Authored by: Darlington Kofa

    3m 48s
    Darlington Kofa
  • EP4: epoch_no

    Authored by: Darlington Kofa

    2m 16s
    Darlington Kofa
  • EP5: max_block_size

    Authored by: Darlington Kofa

    3m 14s
    Darlington Kofa
  • EP6: pool_deposit

    Authored by: Darlington Kofa

    3m 19s
    Darlington Kofa
  • EP7: max_tx_size

    Authored by: Darlington Kofa

    4m 59s
    Darlington Kofa
0:00
/
~0:00