Game-Theoretic Mixer
Last updated
Last updated
The Query Mixer is a key part of the Darwin AI blockchain platform, designed to ensure the authenticity and integrity of AI inferences across distributed networks. It tackles inference validation with Probabilistic Query Mixing, blending real user queries with special challenge queries to ensure nodes process requests honestly. Using our proprietary LLM instructional fingerprint, the system outputs consistent results for specific queries, leveraging a game-theoretic approach to prevent cheating and guarantee honest behavior by inference nodes.
Real-Time Integrity Assurance: The Query Mixer operates in real-time, ensuring that AI inferences across distributed networks are validated without delay. By combining mixing with slashing, the system verifies the authenticity of each inference, guaranteeing swift and accurate responses.
Collusion Prevention with Minimal Overhead: The system's advanced Probabilistic Query Mixing technique reroutes queries through multiple nodes and adds mixing layers, all while maintaining minimal overhead. Each step is transparently recorded on-chain, deterring nodes from dishonest behavior because they risk losing their staked crypto assets.
Fastest Verifiable AI Compute: Leveraging our proprietary LLM instructional fingerprint, the Query Mixer delivers the fastest verifiable AI compute, including inference and fine-tuning.
Pre-computation: Special challenge queries are created using known responses to form a deterministic model fingerprint.
Query Submission: Users send their queries to the Mixer, which initiates the mixing process.
Query Distribution:
Queries, including user and challenge queries, circulate within the Mixer network for node signatures.
These fingerprints are used as challenges within the query batches sent to inference nodes, serving as benchmarks to validate the node's output.
Probabilistic Mixing: Real user queries are mixed with challenge queries, creating batches where the origin of each query (user or challenge) is indiscernible to the inference nodes.
Mixer nodes combine user and challenge queries and forward the batch to the Inference Node.
Model Execution: The Inference Node processes all queries in the batch, treating them as indistinguishable.
Verification: Mixers verify responses to challenge queries, recording the query paths on the blockchain for transparency and ensuring nodes' honest performance.
This section explains calculating the probability of a node passing a series of challenges within an epoch using a Poisson distribution. Given a fixed challenge emission rate, it factors in challenge difficulty and the threshold for node slashing.
The number of challenges a node receives in an epoch is modeled using a Poisson distribution:
Where:
Where:
The fixed challenge emission rate.
The probability that a cheating node passes a single challenge.
The threshold number of failed challenges after which a node will be slashed.
The number of challenges received.
The number of failed challenges.
The overall probability that a node passes the series of challenges within the epoch.
Note:
This calculation is essential for determining the reliability and trustworthiness of nodes in a decentralized network.
Challenge queries are crafted using instructional fingerprinting, which involves fine-tuning a Language Model (LLM) on a meticulously curated dataset of instruction pairs .
The model is engineered to respond to a query with a predefined answer , creating a set of query-answer pairs that serve as the model’s fingerprints.
is the fixed challenge emission rate, representing the average number of challenges a node receives in an epoch.
is the number of challenges.
The probability that a cheating node passes a single challenge is denoted by . For example, for fingerprinting, this probability will be close to 0.
The threshold is the number of failed challenges after which a node will be slashed.
The probability that a node passes the challenges is computed using the following formula:
is the probability that the node receives fewer than challenges.
is the probability that the node receives or more challenges but passes enough of them to avoid being slashed.
For a given , , and :
This formula can be implemented in a programming environment to compute the exact probability .
Accurate computation of helps in setting appropriate thresholds and emission rates to maintain network integrity.