Fediverse as of 2018-08-24.
7 788 instances with 1 303 382 connections.
Let's look at the different clusters. They aren't clearly separated, but still visible.
Searchable (ctrl+f) SVG without lines to keep the filesize reasonable: https://lucahammer.at/vis/fediverse/2018-08-24-fediverse-min.svg
If you compare it with the screenshots, keep in mind that it is mirrored.
Beware of over interpretation. This is no scientific work. It's just one way to visualize instances. There are many others and probably better ones.
Qualifying the connections by amount of follow connections could draw a different image.
In the end it's not important for your everyday use. You can follow every account of every instance (as long as your instance doesn't cut ties with the other one).
Most important are the people you follow and what you toot.
I tried to clean up the script, but it's still messy. Feedback welcome: https://gist.github.com/lucahammer/eb68fc43a34877a304e7b81e5f729c4b
Data as GDF from yesterdays visualization (zipped, 10MB): https://lucahammer.at/vis/fediverse/2018-08-24-fediverse-GDF.zip
@Luca People, there's work to do! We're in the middle of the "niche instances" with a "non-technical focus"... 😟
@hiemstra It's easily possible that my quick interpretations are wrong. Niche is a rather broad term. And I would say it's a good thing.
@Luca I think it's pretty accurate (but we're supposed to be a technical university, so...)
Nice work, BTW!
@Luca How did you differentiate between the violet cluster and the others? I can't believe all the instances in the orange cluster are tech focused for example.
@Luca Cool. Now can anyone tell us what they're each about? It's really hard to find an instance that matches individual interests!
@darwinwoodka I didn't discover topical clusters and I believe it doesn't matter that much which instance you choose. I believe trust into the admins is more important than the theme.
@Luca have you ever ran a plot of instance age (if you save that data) vs nb connections (maybe relative nb connections)? curious to see how they‘re related
@Luca Oh I misunderstood, heh. Wonder why we got pushed to the right a bit. Might be cause I sometimes speak French.
@scarlett The SVG is searchable (Ctrl+f). https://lucahammer.at/vis/fediverse/2018-08-24-fediverse-min.svg
@Luca wow, I can actually see my self-run single-user instance from here! I feel, quite literally, extremely seen. 😁
@fuzzface @Cdespinosa I created a SVG that can be search through (ctrl+f): https://lucahammer.at/vis/fediverse/2018-08-24-fediverse-min.svg (Sorry for flipped axis)
@maloki Thanks for reminding me.
The images show names of instances that are connected through lines. They are one big hairball with two slightly visible groups of instances.
Hat der 'Ball' oder wie man dieses Ding nennen möchte, wirklich leichte Ähnlichkeiten mit dem Icon von GoogleChrome? 🧐
Find die Form interessant. Alles schön Rund. Nicht so eckig und Kantig wie die von Twitter. 🤣
I admit I'm not too familiar with Gephi, but your description sounds like a "regular" force directed graph. If that already produces this, I'm curious what tSNE or umap would produce!
I bet even just feeding a 7788x7788 matrix with binary values based on whether they are connected or not would work well, it's kind of crazy how much structure those two algorithms can reveal!
@vanderZwan Which tools / libraries would you recommend? Preferably Python.
I will clean up my code and share the collection script as well as the collected data. Maybe you can use it.
Well, there is only one real umap implementation at the moment, by Leland McInnes, who the guy who developed the algorithm. Luckily its in Python :)
Here he gives a talk explaining how it works in accessible terms at SciPy 2018:
@le_ArthurDent @vanderZwan Just published the script and data: https://vis.social/web/statuses/100610249989818510
@Luca @vanderZwan It appears that I have a hard time trying to wrangle this up into a matrix of connections, especially since I can't import gdf files directly into R. What is the node list telling me? Edge data seems to be the metadata of the instances.... but still there arent as many levels in the edge list as there are levels of nodes...
@le_ArthurDent @vanderZwan Yes, nodes is simple the data the instances provide about themselves. Only mastodon instances at the moment. Edges are what the Mastodon instances report as their peers. For each peer there is an edge (source, target).
If you tell me in what format you need the data, I could try to adapt the script.
I guess I got it. I was just wondering why the number of different instances in node1 varies from those in node2. How many instances are there? I assumed that there are 7788 instances having connection. So if I got the data right you have here a principal component analysis and a t-SNE of the connections. Colours are groupings obtained by k-means clustering, boldly guessing 10 clusters. I have too look further to map this for a better inspection of the instance names...
@Luca @vanderZwan I made some huuuuge plots with instance names for a visual insepction of the clustering. Just look how perfectly pixelfed.social stands out at the bottom of the scores plot. And in the t-SNE plot you see on the far right peertube and its associated servers *_*.
(I hope this is still readable after uploading it xD"...)
Wow looks interesting, can You toot me an URL, that I can see it later on a bigger screen. On my 5 inch Xperia can't see so much ;)
vis.social is an open social platform for creative people, especially anyone in SciArt, SciComm, data, visualization, creative coding, and related arts and research. English is the common language of the instance.