Wikimedia

GCI Wiki Study / Create a query to get statistics about Wikipedia: Redundant deletion reasons used the week-end

As one of the many editors of the English Wikipedia, I'm asking myself different thoughts regarding wiki statistics and I made a formal study of it. You can help me by answering these questions!

Basic to middle SQL skills are needed to perform these tasks depending on the indicated difficulty.

The target is to have for each question an SQL request and its result in the format of a table. Finally, we need to present the data visually as a graph. See linked page below for detailed instructions.

Redundant deletion reasons used the week-end

When Wikipedia articles (and more generally pages) are deleted, a log entry is created with a reason provided by the administrator that has performed the action. Weekend usually means more vandalism on articles, including dummy page creations that need to be deleted.

Question: We want to get the most redundant reasons in these type of logs happened over the weekend.

Tip: select only the good log_type value in logging table. Join it with comment one. Be careful to limit logs lines to articles and to limit the number of results to avoid timeouts.

Difficulty: average

Task tags

  • statistics
  • gci wiki study
  • deletion
  • sql

Students who completed this task

mogmog123

Task type

  • code Code
  • assessment Outreach / Research
close

2018