Drug discovery is traditionally slow, expensive, and risky. Conventional pipelines can take over a decade and billions of dollars to bring a single drug to market [4]. Challenges include:
NOVA addresses these challenges by leveraging the decentralized power of the Bittensor network [5]. By incentivizing the use of ML-based active learning and heuristic adaptive search methods, NOVA transforms drug discovery into a scalable, efficient search process. Every participant—miner or validator—plays a role in rapidly screening a billion-sized molecular library, optimizing the search for high-affinity, synthesizable drug candidates.
NOVA is one of several specialized subnets within the Bittensor network. While other subnets focus on tasks such as pretraining (SN9), data collection (SN13), and protein folding (SN25), NOVA is dedicated to early-stage drug discovery.
Objective: Find the best molecule in the shortest amount of time. NOVA V1 challenges miners to find a molecule from the database provided with the highest binding affinity to the protein target selected for each challenge. Over time, the challenge will evolve into a multi-parametric optimization exercise that mirrors the process of drug discovery with multiple targets and key physicochemical properties that maximize the likelihood of eventual drug approval.
Its architecture comprises three major components:
Mining Process:
Miners extract candidate molecules from SAVI 2020. They refine their
strategies over time by using adaptive search techniques (e.g., substructure
searches, heuristic filters) or, when applicable, ML-based active learning
methods that are adjusted based on the Deterministic Oracle predictions [2,
3]. (See Appendix A for Miners Concept Logic).
Validation Process:
Validators score the submissions for block n at the end of block n+1 using
PSICHIC, ensuring objective and transparent evaluation. (See Appendix B for
Miners Concept Logic).
Reward Allocation:
At the end of each challenge, the miner that has presented the molecule with
the highest score (binding affinity to the protein) will receive the reward.
(See Appendix C for more on the reward allocation rationale).
PSICHIC is an advanced prediction model that estimates protein–ligand binding affinity using minimal input—typically the protein sequence and the molecule's representation (such as SMILES) [2]. PSICHIC's reproducible, deterministic output predictions provide a clear, objective score, that combined with its high customizability and open-source nature, positions it as the ideal candidate as the first Deterministic Oracle.
Deterministic Evaluation:
PSICHIC's fixed output for any given molecule (given a fixed model version)
ensures that all submissions are scored objectively.
Motivating Optimized Search:
Because the SAVI 2020 database is so large, it's not practical to evaluate
every molecule by brute force in the defined time for each challenge. This
forces miners to develop intelligent strategies—whether through ML-based
active learning or adaptive search using heuristic methods—to hone in on the
best candidates to be evaluated through PSICHIC predictions.
Standard for Success:
In each challenge, the candidate's PSICHIC score is the definitive metric for
success, making it the cornerstone of the competitive process.
The SAVI 2020 database is an ultra-large, synthesizable virtual library generated by applying expert-curated reaction rules to known chemical building blocks [1]. This process yields approximately 1.75 billion compounds that are:
Synthesizable:
Each molecule comes with a clear synthetic route, ensuring that computational
hits can be readily synthesized and tested in the lab.
Chemically Realistic:
SAVI's compounds are grounded in established chemical reactions, ensuring they
are practical for real-world drug development, reducing the odds of spending
time evaluating inaccessible chemical matter while also aiming for
exceptionally high chemical diversity.
Scalability:
SAVI 2020 vastly expands the searchable chemical space beyond what traditional
datasets offer.
Practicality:
Because every candidate was previously evaluated in terms of synthesizability,
any hit identified by miners is actionable, enabling a swift transition from
virtual screening to experimental validation.
Democratization:
As an open-source resource, SAVI aligns with NOVA's decentralized ethos,
providing all participants with the same high-quality foundation for
innovation.
NOVA transforms drug discovery into a well-defined optimization challenge—finding the molecule with the highest binding affinity to a target protein. Instead of a random trial-and-error approach, the process is structured as follows:
Optimization Objective:
Maximize the binding affinity score provided by PSICHIC [2].
Iterative Refinement:
Miners use feedback from PSICHIC to refine their search strategy. This
iterative process can be driven by ML-based active learning—as demonstrated in
Traversing Chemical Space with Active Deep Learning [7]—or by adaptive search
using creative search heuristics.
Heuristic-Driven Exploration:
Even without complex ML models, miners can use creative search heuristics—such
as prioritizing molecules with known active substructures performing well
under the Deterministis Oracle evaluation—to focus on the most promising
regions of the chemical space. Approaches like PyrMD: Accelerated Chemical
Space Exploration Using Active Sampling [8] and MolPAL: An Active Learning
Framework for Molecular Property Prediction [10] provide compelling examples
of how targeted sampling and iterative feedback can dramatically reduce
computational costs and enhance screening efficiency.
This approach is similar to methods used in neural architecture search or reinforcement learning, where systems iteratively improve based on feedback. By combining active learning and adaptive search, NOVA efficiently narrows down a billion-molecule space to the most promising candidates.
Per-Block Submissions:
Miners submit candidate molecules selected from SAVI 2020, with the caveat
that each submission overwrites the previous one. This will ensure that miners
only submit a new molecule when they find a higher affinity score.
Challenge Rounds:
Over the challenge period (360 blocks), miners continuously refine their
submissions using their chosen search strategies. Rewards will be calculated
to determine the highest scoring miner, with lower reward values in the
beginning of the challenge and larger values towards the end of the challenge.
Winner-Takes-All Reward:
At the end of each block, validators evaluate each miner's active submission
with PSICHIC [2] and rank miners based on their scores. The miner with the
highest binding affinity score wins the reward. If the same molecule is
submitted by two different miners or two molecules with the same binding
affinity were submitted by different miners, the miner that submitted it first
wins the sub-challenge.
Intelligent Strategy Requirement:
With only one active submission allowed at a time and a winner-takes-all
reward, miners must also focus on quality, not only quantity.
Continuous Improvement:
The competitive nature drives miners to refine their strategies iteratively,
using feedback from PSICHIC to guide enhancements.
Transparency and Fairness:
Every submission is evaluated objectively, ensuring a level playing field for
all participants.
Miner Submissions:
Each miner can submit up to one candidate molecule at any given time. This
means that miners are incentivized to make another submission only when they
find a better candidate.
PSICHIC Evaluation:
At the end of each block (~12 seconds), validators perform an evaluation of
the binding affinity score of each miner's active submission. Winner molecules
and their binding affinities are presented to the community on the
leaderboard, further refining the community's search strategies and
incentivizing competition.
Winner Determination:
Every 360 blocks, validators define the best miner based on the best score.
Ties are solved by prioritizing who submitted the best performing molecule
earlier. The candidate with the highest PSICHIC score receives the TAO reward
for that chunk, which is expected to be lower in the beginning of the
challenge and higher towards the end, incentivizing continuous participation.
Objective, Fixed Scoring:
By using PSICHIC as the Deterministic Oracle we ensure the elimination of
subjective bias in the definition of winner molecules given its deterministic
predictions.
Spam/Attacks Prevention:
Limiting submissions to one active submission at any given time incentivizes
miners to focus on quality rather than indiscriminate volume of submissions.
Miner
Validator
Network:
Multi-Parameter Validation:
Incorporate multi-target predictions and additional relevant metrics (e.g.,
drug-likeness, ADME/Tox, synthetic feasibility) to ensure the best candidates
are viable.
Reputation and Staking:
Implement validator reputation systems and staking for miners and validators
to further secure the process.
Enhanced Feedback Loops:
Use continuous validation data to refine PSICHIC, providing target-specific
fine-tuned versions for specific challenges.
NOVA leverages alpha TAO tokens to reward innovation and efficiency:
Winner-Takes-All:
The miner with the highest PSICHIC score at the end of each challenge wins the
alpha TAO reward for that chunk. Reward values will grow as the challenge
progresses, incentivizing continuous participation. And aligning the
incentives with rewarding exceptional search mechanisms.
Performance-Based Scaling:
Consistently high-performing miners build reputations, unlocking higher-value
challenges over time.
Validator Rewards:
Validators earn fees for fast, reliable, accurate and objective evaluation
using the Deterministic Oracle, reinforcing honest processing.
Alignment of Interests:
Miners and validators are incentivized to act in ways that enhance the overall
quality and scientific output of the network, maximizing the potential of IP
generation.
Sustainable Growth:
As more participants join, cumulative contributions drive both scientific and
economic progress.
Collective Knowledge:
Each round enriches a shared repository of high-potential drug candidates,
further accelerating drug discovery.
Increased Participation:
More miners and validators mean greater collective computational power,
enabling a deeper exploration of chemical space.
Cumulative Data Enrichment:
Each validated candidate adds to the network's repository of molecules–target
interactions, refining future search and prediction strategies.
Interoperability with Other Subnets:
NOVA challenges hold potential to benefit from integration with other
Bittensor subnets (e.g. SN9, SN25, SN13), creating a synergistic ecosystem
that amplifies innovation.
ADME/Tox Integration:
Expand the validation process to include additional metrics such as
pharmacokinetics and toxicity. A drug may have a high binding affinity to a
desired protein target but lack other properties in order to become a
successful therapeutic.
Wet lab validation:
Top miner submissions will be compiled in tokenized libraries, synthesized,
and tested in wet labs. Results from in vitro and in vivo assays will be used
to further the drug development process and used to fine tune the oracle
model; thus, improving the predictive power of the subnet.
Community-Driven Innovation:
Foster open-source collaboration to continuously refine miner algorithms and
share best practices.
NOVA is a transformative platform that leverages decentralized computing, intelligent search techniques, and a robust, transparent validation process to revolutionize drug discovery. By reimagining drug discovery as an optimization problem and harnessing a billion-molecule library of synthesizable compounds, NOVA creates a dynamic ecosystem where every miner submission and every 360 blocks (~1-hour) challenge round pushes the frontier of therapeutic innovation.
Through the integration of PSICHIC as a deterministic oracle [2], the strategic use of the SAVI 2020 database [1], and a competitive mining model supported by a solid TAO token incentive structure, NOVA ensures that only the most promising drug candidates move forward. This approach not only accelerates the discovery process but also bridges the gap between computational prediction and real-world experimental validation.
For the blockchain community, NOVA is a powerful demonstration of how decentralized networks and tokenized incentives can drive meaningful scientific breakthroughs. For drug developers, it represents a scalable, efficient, and transparent pathway toward discovering the next generation of therapeutics.