funded

Catalyst: Exploratory Data Analysis

$4,000.00 Received
$4,000.00 Requested
Ideascale logo View on ideascale
Community Review Results (1 reviewers)
Addresses Challenge
Feasibility
Auditability
Problem:

<p>Catalyst already generates lots of data (voting results, advisors scores…) that are not being used to bring value to the process.</p>

Yes Votes:
₳ 190,379,367
No Votes:
₳ 18,602,365
Votes Cast:
1092

This proposal was approved and funded by the Cardano Community via Project F6: Distributed decision making Catalyst funding round.

  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download
  • download

Detailed Plan

.

<u>Exploratory Data Analysis</u>

In statistics, Exploratory Data Analysis (EDA) "refers to the critical process of performing initial investigations on data so as to discover patterns, to spot anomalies, to test hypothesis and to check assumptions with the help of summary statistics and graphical representations." ( <https://towardsdatascience.com/exploratory-data-analysis-8fc1cb20fd15> ).

A EDA gives an overview of the main characteristics of the analyzed data and many times it's useful to provide insights, and it is usually the first step in a Data Science analysis, before applying more advanced methods, for e.g., machine learning.

<u>The data of Catalyst</u>

Project Catalyst has just completed one year of existence and 5 complete Funds. During this time, lots of data were generated, such as:

  • Voting results
  • Proposal data
  • Community Advisors (CA) assessments
  • Veteran Community Advisors (vCA) review
  • Challenges budgets
  • Voting Power participating in Catalyst
  • Unique wallets participating in Catalyst

Although all this information is available to the community, it is still not being used to bring value to the process.

<u>Catalyst: Exploratory Data Analysis</u>

An EDA for Catalyst can provide many information and insights based on previous Funds, for example:

  • Influence of CAs Scores in the voting results.
  • Relation between Score, % of Challenge Budget requested and voting results (proposal funded or not, approved or not).
  • Statistics of Funded, Not Funded and Not Approved proposals: average score, minimum score of a Funded proposal in each challenge, maximum score of a Not Funded proposal in each challenge, absolute numbers and percentages of Funded, Not Funded and Not Approved proposals.
  • Number of CAs assessments rated as 'Excellent', 'Good' and 'Filtered Out' by vCAs.
  • Variance of CAs scores in a single proposal.
  • Analysis of Voting Power vs Unique Wallets, looking for the influence of whales on the results.
  • Alternative voting results: ranking by Unique Wallets instead of Voting Power.
  • Evolution of Catalyst over the time. Numbers of: proposals, active CAs, active vCAs, challenges, voting power, unique wallets.
  • Many more information and analysis that the community might be interested in.

<u>Some examples of these plots and information can already be seen in the attachments of this proposal.</u>

<u>Why submitting this proposal in 'Distributed Decision Making'?</u>

Why is this Challenge important? Because 'high-quality and decentralized decision-making will increase treasury ROI and legitimize decentralized governance.' Also, the leading question of this Challenge is 'How can we help the Catalyst community to get better at distributed decision making within the next two Catalyst rounds?' ( <https://cardano.ideascale.com/a/campaign-home/26104> )

This proposals aims to provide a better understanding of how Catalyst works through information such as what are the weak point, what could be improved and what proposers should aim for in order to have higher chances of being funded. All this information and many more generated via the proposed EDA analysis will increase significantly the evolution of Catalyst and strengthen the this Distributed Decision Making process in the Cardano ecosystem.

<u>Deliverables and Milestones</u>

After funded, I will provide EDA Reports for 4 Funds (Funds 4 to 7). These Reports will include the topics mentioned above, and more information and analysis that the community might be interested in. Not only the plots will be included, but also a comprehensive explanation and discussion on the analysis results will be provided.

In alignment with the Challenge goal to support the Distributed Decision Making in the next two Catalyst rounds, the analysis of Funds 6 and 7 will support the process during Funds 7 and 8, respectively. Also, recent data of Funds 4 and 5 will help to understand the evolution of Catalyst process and to support the analysis of current state.

EDA Reports will be delivered at the end of the following Funds (dates may change according to Catalyst schedule):

  • Reports analyzing Funds 4, 5 and 6: 2021 Q4
  • Report analyzing Fund 7: 2022 Q1

Therefore:

  • After 3 months: 3 EDA reports delivered to the community.
  • After 6 months: 4 EDA reports delivered to the community.

<u>Data Sharing</u>

The raw and treated data, as well as the JupyterLab Notebooks used to generate the charts and plots, will be shared with the community through a open repository in GitHub.

Also, a partnership with the Community Landing Page ( <http://cardanocataly.st/> ) might occur in order to create interactive versions of the charts presented in the EDA reports generated through this proposal.

<u>My Background and Experience</u>

I joined Catalyst during Fund 3, when I was a CA for the first time and since then. Also, I'm a vCA since Fund 4. I actively contributed to the project also by support the creation of community guidelines, as a Proposal Mentor, and more recently as the CAs representative in the 1st Catalyst Circle ( <https://iohk.io/en/blog/posts/2021/07/08/introducing-the-catalyst-circle> ). I know many aspects of Project Catalyst and I can communicate with the community in order to maximize the Return on Intention of this proposal.

Regarding my academic background, I've got a BSc. in Chemical Engineering, a MSc. in Chemical Engineering and Software Development, and a specialization in Data Science, Machine Learning and Artificial Intelligence. Also, I'm currently a PhD candidate researching the field of Machine Learning applied to Fluid Dynamics.

LinkedIn:
<https://www.linkedin.com/in/victorcorcino/>

<u>Budget Breakdown</u>

For a single report, the following hours are estimated:

  • Data treatment and pre-analysis work: 4h
  • EDA: 6h
  • Results comprehensive analysis: 6h
  • Report: 4h
  • Total hours for each report: 20h

Considering a total of 4 reports and an hourly cost of $50/h:

  • Total proposal budget: 4 reports x 20h/report x $50/h = $4000

.

コミュニティ・アドバイザー・レビュー (1)

Comments

Monthly Reports

The project is delayed in 1 fund due to overload of work from my side. When I deliver the results of this proposal, I’m going to cover 5 instead of 4 funds, as specified in the proposal, as a way to compensate for this delay.

Disbursed to Date
$4,000
Status
Still in progress
Completion Target
5/31/2022
Comments 0

Login or Register to leave a comment!

The project is delayed in 1 fund due to overload of work from my side. When I deliver the results of this proposal, I’m going to cover 5 instead of 4 funds, as specified in the proposal, as a way to compensate for this delay.

Disbursed to Date
$4,000
Status
Still in progress
Completion Target
5/31/2022
Comments 0

Login or Register to leave a comment!

As stated in the previous report, the project is delayed in 1 fund due to overload of work from my side. When I deliver the results of this proposal, I’m going to cover 5 instead of 4 funds, as specified in the proposal, as a way to compensate for this delay. I'm finishing the report now and expect to release it within the next 1 or 2 weeks.

Disbursed to Date
$4,000
Status
Still in progress
Completion Target
5/31/2022
Comments 0

Login or Register to leave a comment!

I had a delay last month because I've been sick for almost 3 weeks. The analysis is almost done and I believe I will have the report ready within 2 weeks.

Disbursed to Date
$4,000
Status
Still in progress
Completion Target
6/30/2022
Attachment(s)
Comments 0

Login or Register to leave a comment!

close

Playlist

  • EP2: epoch_length

    Authored by: Darlington Kofa

    3分 24秒
    Darlington Kofa
  • EP1: 'd' parameter

    Authored by: Darlington Kofa

    4分 3秒
    Darlington Kofa
  • EP3: key_deposit

    Authored by: Darlington Kofa

    3分 48秒
    Darlington Kofa
  • EP4: epoch_no

    Authored by: Darlington Kofa

    2分 16秒
    Darlington Kofa
  • EP5: max_block_size

    Authored by: Darlington Kofa

    3分 14秒
    Darlington Kofa
  • EP6: pool_deposit

    Authored by: Darlington Kofa

    3分 19秒
    Darlington Kofa
  • EP7: max_tx_size

    Authored by: Darlington Kofa

    4分 59秒
    Darlington Kofa
0:00
/
~0:00