-
Notifications
You must be signed in to change notification settings - Fork 550
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Local sparsification #1027
Comments
Structure-Preserving Sparsification Methods for Social Networks is about methods that work on unweighted graphs. They do reference Extracting the multiscale backbone of complex weighted networks approvingly as an approach for weighted networks. |
in Unsupervised Sparsification of Similarity Graphs, Gollub and Stein make the following claim (or something close to it): Let Which should imply that, where The authors then make an unexpected claim that this is equivalent to i.e. that the similarity between two objects that belong to the same class should be larger than the similarity between either of those objects and the graph centroid. This seems clearly untrue. Imagine a graph with only two edges. If the last proposed rule holds, the two nodes would never be put in the same class. The more defensible rule, that has some surprising consequences. It means that the the threshold for two nodes being put in the same class should be higher for nodes that are near the center of the graph and lower for nodes that are at the periphery. |
Improving Short Text Clustering by Similarity Matrix Sparsification basically picks up at but adds standard deviation. Let Then, Rakib and his co-authors propose the rule that This is basically a z-score based approach, and seems quite interesting. But, beyond setting Also, the shift from maximum to minimum seems hard to justify. |
ultimately, we are going to be thresholding clusters, so let's think about how we can use that fact for sparsification. let our Pairwiseif two nodes, Trianglethree nodes, So, then what is the minimum value that a particular edge can have? Assume all other new edges have a value of 0, then This does not look very promising as a form of edge sparsification, but i wonder if it could be valuable as a form of node sparsification. If every edge incident on a node |
Extracting the multiscale backbone of complex weighted networks is very local. Dyads that were very low scoring would not be removed. Not quite what we want. |
just did a quick and dirty check on csv example, and filtering out nodes like this this removed a bit over 10% of edges |
Another thing we could sparsify If an edge is the only path between two nodes, and the edge weight is greater then graph TD;
A-->C;
B-->C;
C-->D;
D-->E;
D-->F;
Don't know if there's an efficient way to identify such single path. |
Actually, if the edges between two parts of a graph are such that their weights are greater than the threshold, we should be able to cut those edges. graph TD;
A-->C;
B-->C;
B-->F;
C-->D;
D-->E;
D-->F;
if |
keep going down this road, and we are basically re-implementing clustering... |
We are contemplating exposing an argument that should sparsify the entire pair-score edgelist (#1026).
It would be very nice if we had some principled way to do this automatically for users. @fjsj and I discussed some "global" approaches previously (#834 (review)), but they didn't look to good.
Flavio also provided some links to "local" sparsification.
it would be great to see if one of these would be appropriate.
The text was updated successfully, but these errors were encountered: