Hi, here are some newbie questions, but first some background:
I'm trying to get my mind around a large data warehouse. I started out filtering SQL schemas to provide .gv input summarized by grep and sed and simplified as follows:
digraph dg {
tablename1 -> foreignkey1.in.tablename1;
tablename1 -> foreignkey2.in.tablename1;
tablename2 -> foreignkey1.in.tablename2;
tablename2 -> foreignkey2.in.tablename2;
and so on...
}
The idea was to dump this in graphviz and let dot have at it. Well, dot did its best but the result was a large indigestible lump of nodes and arcs.
Then I stumbled upon Gephi and fed the .gv file to it. I got another indigestible lump, but this time I was able to apply the Yifan Hu Proportional layout to it and zoom in. The most highly-connected nodes stood out from the other ones and allowed me to visually identify the most important top-level tables and relationships in the system. Also, the most interrelated nodes tended to end up together, so closeness became a measure of affinity and helped me envision an eventual partition of the system into families of tables and foreign keys. It looks very much like a star chart, with the client galaxy connected to the service provider galaxy via the intermediate payment relationship galaxy. Tres cool, mes vieux.
But I want more!
I'd like to know how to increase node size depending on the simple number of connections (arcs) it has, and (more power! more! more!), I'd like to know how to set up the Data Table to display only the nodes selected in the Graph window. This would be majorly cool (as my granddaughter might say) when doing interactive graph exploration and selecting little groups of nodes here and there to see what conceptual affinity they have as reflected in the table names, in addition to sheer arc connectivity. I have visions of brown bag sessions using this powerful tool to mine the subject matter experts for database knowledge.
Summarizing,
1. How do I size node representations according to arc connections, and
2. how do I set up the Data Table to display only the node and arcs currently selected in the graph?
setting node sizes based on connections, and the data table
- eduramiba
- Gephi Code Manager
- Posts:1064
- Joined:22 Mar 2010 15:30
- Location:Madrid, Spain [phpBB Debug] PHP Warning: in file [ROOT]/vendor/twig/twig/lib/Twig/Extension/Core.php on line 1275: count(): Parameter must be an array or an object that implements Countable
Re: setting node sizes based on connections, and the data ta
Hi,
Cool that you enjoy Gephi
Lets see.
1. This is done with ranking window. I suggest you to look at this whole tutorial http://gephi.org/2010/quick-start-tutorial specially slide 15 for what you want. You will have to select Degree (number of connections) for sizing nodes.
2. This is not done (an idea exists). But you can workaround this easily. Select the nodes that you want to see, right click and choose Copy to - New workspace. That will create a subgraph with these nodes as a new workspace.
Eduardo
Cool that you enjoy Gephi
Lets see.
1. This is done with ranking window. I suggest you to look at this whole tutorial http://gephi.org/2010/quick-start-tutorial specially slide 15 for what you want. You will have to select Degree (number of connections) for sizing nodes.
2. This is not done (an idea exists). But you can workaround this easily. Select the nodes that you want to see, right click and choose Copy to - New workspace. That will create a subgraph with these nodes as a new workspace.
Eduardo
-
- Posts:4
- Joined:04 Sep 2011 23:24 [phpBB Debug] PHP Warning: in file [ROOT]/vendor/twig/twig/lib/Twig/Extension/Core.php on line 1275: count(): Parameter must be an array or an object that implements Countable
Re: setting node sizes based on connections, and the data ta
Thanks for the reply.
Re #2, the logical place in the user interface might be a checkbox at the top of the Data Table display window that, when checked, suppresses display of nonselected node rows.
By the way, what I am looking for is a way to leave the graph alone, but restrict the Data Table display to just selected rows; alternatively, being able to sort them all together at the top of the table would also work.
Re #2, the logical place in the user interface might be a checkbox at the top of the Data Table display window that, when checked, suppresses display of nonselected node rows.
By the way, what I am looking for is a way to leave the graph alone, but restrict the Data Table display to just selected rows; alternatively, being able to sort them all together at the top of the table would also work.