International Trade Network Data...
There are two principle sources of network data on international trade (that I have been able to find):
- Prof. Kristian Skrede Gleditsch at the University of Essex: Data are from 1948-2000 and the primary source is the IMF.
- UN Comtrade: Data are from 1962-2009 and the primary source is of course the UN.
- There is a third data source, the Economics Web Institute, that I would like to dissuade people from using even though they use Prof. Gleditsch's IMF data (without going into too much detail, the Economics Web Institute, seems to use an inconsistent methodology to assign edge weights (trade values) to countries when converting from Prof. Gleditsch's raw .asc files to .xls workbooks).
Hierarchical Clustering and Trade Network Density:
For the moment I am simply trying to replicate the work of the Deem et al. (2010) paper (linked to above). I have applied a average linkage hierarchical clustering algorithm to the OECD international trade network as outlined in their paper. Below are some plots of the cophenetic correlation coefficient (CCC) and network density for the international trade network for OECD countries using two different data-sets. The first plot uses data entirely from the UN Comtrade database. NBER recessions are marked with gray bars. I downloaded the data by hand from Comtrade (commodity code is SITC ver. 1 AG0) and then used a Python script to clean and reorganize the .xls spreadsheets into more manageable text files. Statistical analysis is done using SciPy, network analysis (so far) has been done using NetworkX, and plotting has been done using Matplotlib.
The plot below is the CCC and network density for the international trade network for OECD countries using Prof. Gleditsch's IMF data for 1948-2000 and then UN Comtrade from 2001-2009 (commodity code this time is SITC ver. 3 AG0).
There are some differences between the two plots of the CCC (I have not yet tested whether or not they are significantly different nor have I tested whether or not the CCC jumps significantly during/after recessions...this is on my to do list!).
Still to come:
- Dendrogram of identified clusters
- Results of community structure algorithm applications
- Weighted clustering algorithms and other graph measures
No comments:
Post a Comment