Random Number Generators in Rel
We are excited to share that we have once again expanded the functionality of our Relational Knowledge Graph Management System (RKGMS). We have implemented multiple pseudorandom number generators (PRNGs) that can now be used to build probabilistic models in Rel. Specifically, three general-purpose PRNGs are currently available via our Rel Standard Library:
Mersenne Twister
Mersenne Twister is one of the most widely used PRNGs. In Rel, the relation random_mersenne_twister
generates a floating-point random number (between 0.0
and 1.0
) for every tuple in a relation:
random_mersenne_twister[42, {(1, "January"); (2, "February")}]
def output = random_mersenne_twister[42, {(1, "January"); (2, "February")}]
Providing the seed (here: 42
) ensures that the Rel model is deterministic and reproducible—meaning it always re-evaluates to the same pseudorandom number.
As well as numbers, DateTime
data can also be used as seeds.
random_mersenne_twister[datetime_now, range[1, 5, 1]]
random_mersenne_twister[datetime_now, range[1, 5, 1]]
The relation range
can be used to conveniently create five random numbers.
Threefry
Threefry is a simplified Threefish algorithm which uses a UInt64 state space.
The relations random_threefry_uint64
and random_threefry_float64
can be used to create random UInt64
and Float64
numbers.
def output:uint64 = random_threefry_uint64[1234, 3]
def output:float64 = random_threefry_float64[1234, 3]
The random floating-point numbers lie between 0.0
and 1.0
, whereas the random UInt64
numbers are distributed over the entire UInt64
value range.
In contrast to the Mersenne Twister implementation, the Threefry implementation is first-order, meaning it expects individual Int64
or UInt64
values rather than relations with arity greater than 1
.
It is even possible to create independent random sequences, where the seed itself is randomly generated.
def seed = random_threefry_uint64[1234, 0]
def output = random_threefry_float64[seed, i]
for i in range[1, 3, 1]
Threefry falls in the class of counter-based random number generators which are well-suited for generating random numbers in a highly parallelized fashion using multiple CPU threads or GPUs.
Random Device
The relations random_unit64
and random_uint128
use a hardware-based random device provided by the system to generate random numbers.
def output:uint64 = random_uint64
def output:uint128 = random_uint128
No seed is needed because the randomness is hardware-based.
Due to the nature of the random device, the generated random numbers are not reproducible and change for every transaction. This can have huge performance impact and negative consequences for incremental view maintenance as values change every time this relation is re-evaluated.
In contrast to the Mersenne Twister and Threefry PRNGs, the relations random_unit64
and random_uint128
produce only single random numbers and cannot produce sequences of random numbers in Rel.
The random number from random_unit64
can also be used as a seed for Mersenne Twister and Threefry, chaining different PRNGs together.
def data = {1; 2}
def my_numbers:MT =
random_mersenne_twister[random_uint64, data]
def my_numbers:threefry =
random_threefry_float64[random_uint64, i]
for i in data
def output = table[my_numbers]
What's Next?
If you have a favorite PRNG that you would like to see in Rel, please let us know.
Stay tuned as we are developing a uniform interface which can be used across all our pseudorandom number generators.