Re: panel-collected data (repeated source-target entries)

2017-08-30T16:22:52+01:00

Thank you. Will do.

Statistics:Posted by CIPM/SIPM — 30 Aug 2017 16:22

Re: panel-collected data (repeated source-target entries)

2017-08-29T20:19:19+01:00

Hi,
Gephi supports parallel edges (repeated) with different attributes, and there should be no cons to that.
Please report any problem you may find. I advice you to test it with latest 0.9.2 pre-release https://github.com/gephi/gephi#nightly-builds

Statistics:Posted by eduramiba — 29 Aug 2017 20:19

panel-collected data (repeated source-target entries)

2017-08-28T20:41:47+01:00

I am very new to network analysis, and I have not found an answer to the following within the Gephi documentation or forums. This is a network analysis question.

I am working with data for a network of 200+ organizations all communicating among themselves. The network is undirected.
Because the organizations cannot all be interviewed, a panel of 20 experts was asked which organizations communicate with which others, according to their knowledge. Not all experts knew all 200 organizations, but they all answered for all pairwise combinations of organizations they knew.

As a result, we have a dataset for which many of the source-target pairs are repeated. For example, (Organization ABC,Organization XYZ) may have been cited by 4 panelists, and is thus present four times in the dataset.
Also, because of the way the data was collected, 'mirror' pairs are also present. For example, another 3 panelists might have cited the (Organization XYZ,Organization ABC) pair, bringing that pair to 7 instances in the data, because communication as investigated here is undirected.

My options are
1) eliminate all repeats. In the example, keep only one instance of (Organization ABC,Organization XYZ), and delete the other 6. I can generate a count (=7) and store it into an additional variable.

2) eliminate all the repeats for a given source-target combination, but not its mirror. In the example, keep only one each of (Organization ABC,Organization XYZ) and (Organization XYZ,Organization ABC). I can generate a count for each of those (=4, 3)

3) keep the data exactly as is. Four instances of (Organization ABC,Organization XYZ), three instances of (Organization XYZ,Organization ABC).

Are there pros and cons to each approach, or is one of them correct, and the others not?

Statistics:Posted by CIPM/SIPM — 28 Aug 2017 20:41

Gephi forums

Re: panel-collected data (repeated source-target entries)

Re: panel-collected data (repeated source-target entries)

panel-collected data (repeated source-target entries)