Gephi – The Open Graph Viz Platform

uniqueuid · on April 6, 2022

Gephi is great for exploratory work, but I’ve seen it lure people into using methods that they don’t fully understand. The same problem that many statistics apps face.

IMO, a better approach to proper network analysis is to use a library such as the excellent igraph in R or Python together with a clear understanding of the measures.

There is a wonderful book called „network analysis literacy“ by Katharina Zweig which really helps with the latter.

usgroup · on April 6, 2022

Every once in a while I have a network related data science type problem to muddle through and I invariably want to try visualise some aspect of it, so I invariably try use Gephi for it, and it invariably leads to a frustrating experience for anything non trivial, and so I invariably end up doing it in code + graphviz instead.

uniqueuid · on April 6, 2022

To be fair, a large part of the problem is that many graph-related algorithms are simply very expensive (computationally). For example, many layouts (especially classical ones such as Fruchterman-Rheingold) are very slow. That often makes visualization frustrating when you have more than a couple of hundred nodes and/or a dense network. And we should acknowledge that the Gephi people have put a lot of work into making it work.

So network analysis evidently benefits from think-then-do approaches, while exploratory work is really hard.

uniqueuid · on April 6, 2022

And here is the book link: https://link.springer.com/book/10.1007/978-3-7091-0741-6

Simon_O_Rourke · on April 6, 2022

I use Gephi a lot with work - it's helped explain lots of network complexity to leadership teams. They don't typically understand when you talk about risks of a network partition or node failure, but you visualize it and suddenly it's much more apparent. The circle layout, for me, works best with larger graphs.

My only quibble would be a better graph search function and highlighting.

jacomyma · on April 6, 2022

Point well taken.

t43562 · on April 6, 2022

I love it because it handled huge dependency graphs from Symbian and Yocto builds which showed the brutal level of interconnections and where the really big dependency magnets were (GUI toolkits like Qt depend on almost everything). Nothing else that I could find really showed that. It was a bit tricky to use sometimes but revolutionary in terms of giving me a gut understanding of the problems I was facing.

wirthjason · on April 6, 2022

I’m a huge fan of Cytoscape.js. Not sure if it would be a competitor to Gephi as it’s just a JavaScript library but it’s very useful for things one might use D3 for. Not too not does it have the ability to draw, style, and animate the networks it has all the graph algorithms to do the analysis and traversal.

https://js.cytoscape.org/

ngcc_hk · on April 6, 2022

Is this once again another self contained one with plug-in architecture?

Or can it call from external like lisp-stat (sorry if not really apologetic) …

And given it is seem step into my current search for social simulation env, how can this visionisation give itself to statistical analysis result and then move into some parameters (programming) so we can simulate, collect, trial some strategy (both manual or limited old style programming like lisp or even go for new era AI which look at output pattern/input/policy …

Or more side track a bit but perhaps also important run or at least control under exploratory Jupiter-lab notebook. If not programming at least as documentation and testing / demo to fellow researcher or just student.

uniqueuid · on April 6, 2022

To do that you want igraph or networkx in R or Python. To visualise, use the ggraph package [1]

[1] https://www.data-imaginist.com/2017/ggraph-introduction-layo...

m3047 · on April 6, 2022

I've been thinking that Gephi is getting long in the tooth. Has anybody tried Cytoscape? (https://cytoscape.org/) (DNS is SERVFAILing at the moment.)

I use it for a combination of "no K" clustering (general exploration) and what's referred to in threat intelligence by the term of art "pivoting".

ta988 · on April 7, 2022

Everytime I try Gephi I tend to go back to Cytoscape (desktop not JS). I need to see how it evolved it has been a couple of years now.

m3nu · on April 6, 2022

Works well and results look good. There wasn't a release for a few years, so be sure to use the nightly version.

mbastian · on April 6, 2022

The 0.9.3 version has just come out yesterday, announcement to come on the blog. https://github.com/gephi/gephi/releases/tag/v0.9.3

It was in pre-release for a few weeks to address any regressions.

m3nu · on April 6, 2022

Oh. Lucky coincidence. Great to have an official release again.

eesmith · on April 6, 2022

https://github.com/gephi/gephi/releases says 0.9.3 was released 21 days ago.

The previous, 0.9.2, was indeed in 2017.

dpoljak · on April 6, 2022

Is there anything similarly capable for web? From what I've experienced Gephi is still the go to solution at many places?

jacomyma · on April 6, 2022

As a joke answer, check Gephisto: https://jacomyma.github.io/gephisto/ (it's actually a design experiment, but I'll say no more).

dpoljak · on April 6, 2022

It definitely fulfilled my expectations for the joke part :)) Regardless, there is quite a bit of source available in the git repo so it shows it can be done even though it chugs quite a bit.

lmeyerov · on April 6, 2022

Yes! :)

Our visual graph AI tool includes a GPU-accelerated take on gephi's flows and puts on the web, including a free GPU-accelerated tier with no-code UIs, embedding & control APIs (python, js, react, arrow), and deep pydata integration (Jupyter, RAPIDS, dashboarding like databricks & streamlit, ...): www.graphistry.com/get-started .

It's used a lot by folks doing fraud, IT, social, security, supply chains, anti-misinfo, finance, bio, etc. Mostly data scientists today, and as we have been launching no-code & low-code features, a diverse broader analyst community has been growing, who has been inspiring.

Gephi got a bit frozen in time due to the usual problem of struggling for post-phd sustainability by not building it in: I'm a big fan of the founders and their work, and just like Graphviz (att research canceled it), it was painful watching them having to leave something so cool. We prioritized sustainability as an engine for reliable & growable OSS, which has worked (ex: you may have heard of Apache Arrow, which we helped kick off). So our free SaaS tier aims to include everything in Gephi, and a lot missing in it for modern use: GPU accel, DB connectors & visual playbooks (already in self-hosted), visual graph ETL, and launching a bunch of graph AI stuff (entity linking, event scoring, recommendations, ... by automating UMAP and graph neural network flows). Likewise, we have + are steadily launching things not in Gephi yet you'd expect of modern team+enterprise tools like sharing, RBAC, SSO, daily-scanned docker/k8s/AMIs, etc. We are aiming for a model basically somewhere between gitlab and GitHub, and as we hit more sustainability, keep biasing for more free & OSS.

The good news is, years later.. it worked! We have reached sustainable growth & measurably best-in-class performance, so we are now growing, releasing more (including another big OSS visual auto-AI release just this week), and overall moving to next phases. If you like webgl, JS, or sales engineering (same industries you'd see in graph DBs), we are hiring for multiple roles in visual graph AI, and I'd very much love to chat :)