[phpBB Debug] PHP Warning: in file [ROOT]/phpbb/session.php on line 583: sizeof(): Parameter must be an array or an object that implements Countable
[phpBB Debug] PHP Warning: in file [ROOT]/phpbb/session.php on line 639: sizeof(): Parameter must be an array or an object that implements Countable
[phpBB Debug] PHP Warning: in file [ROOT]/includes/functions.php on line 4516: Cannot modify header information - headers already sent by (output started at [ROOT]/includes/functions.php:3262)
[phpBB Debug] PHP Warning: in file [ROOT]/includes/functions.php on line 4516: Cannot modify header information - headers already sent by (output started at [ROOT]/includes/functions.php:3262)
[phpBB Debug] PHP Warning: in file [ROOT]/includes/functions.php on line 4516: Cannot modify header information - headers already sent by (output started at [ROOT]/includes/functions.php:3262)
Gephi forums •panel-collected data (repeated source-target entries)
Page 1 of 1

panel-collected data (repeated source-target entries)

Posted: 28 Aug 2017 20:41
by CIPM/SIPM
I am very new to network analysis, and I have not found an answer to the following within the Gephi documentation or forums. This is a network analysis question.

I am working with data for a network of 200+ organizations all communicating among themselves. The network is undirected.
Because the organizations cannot all be interviewed, a panel of 20 experts was asked which organizations communicate with which others, according to their knowledge. Not all experts knew all 200 organizations, but they all answered for all pairwise combinations of organizations they knew.

As a result, we have a dataset for which many of the source-target pairs are repeated. For example, (Organization ABC,Organization XYZ) may have been cited by 4 panelists, and is thus present four times in the dataset.
Also, because of the way the data was collected, 'mirror' pairs are also present. For example, another 3 panelists might have cited the (Organization XYZ,Organization ABC) pair, bringing that pair to 7 instances in the data, because communication as investigated here is undirected.

My options are
1) eliminate all repeats. In the example, keep only one instance of (Organization ABC,Organization XYZ), and delete the other 6. I can generate a count (=7) and store it into an additional variable.

2) eliminate all the repeats for a given source-target combination, but not its mirror. In the example, keep only one each of (Organization ABC,Organization XYZ) and (Organization XYZ,Organization ABC). I can generate a count for each of those (=4, 3)

3) keep the data exactly as is. Four instances of (Organization ABC,Organization XYZ), three instances of (Organization XYZ,Organization ABC).

Are there pros and cons to each approach, or is one of them correct, and the others not?

Re: panel-collected data (repeated source-target entries)

Posted: 29 Aug 2017 20:19
by eduramiba
Hi,
Gephi supports parallel edges (repeated) with different attributes, and there should be no cons to that.
Please report any problem you may find. I advice you to test it with latest 0.9.2 pre-release https://github.com/gephi/gephi#nightly-builds

Re: panel-collected data (repeated source-target entries)

Posted: 30 Aug 2017 16:22
by CIPM/SIPM
Thank you. Will do.