How much processing power is required to run these kinds of algorithms?
I don't know how long it took him but I have worked with similar datasets (60 million wikipedia pages, multiples of that in links) and I would say that's the sort of thing that runs overnight on modern hardware. Maybe if he has a cluster or powerful server with Intel Xeons or AMD Threadrippers you can work almost normally with it - imagine drinking a coffee during runs. Hard to say.