Skip to content

burgeng/WikiView

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 

Repository files navigation

WikiView

WikiView is a Python-based Wikipedia hyperlink scraper, paired with an interactive frontend for visualization and exploration of resulting network structures.

Usage:

Before beginning, ensure you have cloned this repository to your local machine.

  • Generate a corpus of craled Wikipedia articles from a desired starting (seed) article. For example, to start from the article for Baseball:
python ./crawl_wikipedia.py "https://en.wikipedia.org/wiki/Baseball" <max. depth> <max. links per page>

i.e.

python ./crawl_wikipedia.py "https://en.wikipedia.org/wiki/Baseball" 3 20
  • The corpus wiki_graph.gexf (Graph Exchange XML Format) will be generated in the current working directory, which can be uploaded to network visualization tools like Gephi.
image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages