30 sats \ 0 replies \ @zuspotirko OP 5 May \ parent \ on: I Made a Graph of Wikipedia... This Is What I Found BooksAndArticles
I don't know how long it took him but I have worked with similar datasets (60 million wikipedia pages, multiples of that in links) and I would say that's the sort of thing that runs overnight on modern hardware. Maybe if he has a cluster or powerful server with Intel Xeons or AMD Threadrippers you can work almost normally with it - imagine drinking a coffee during runs. Hard to say.