One of the questions I get asked a lot is “You provide various statistics for Fedora, can you show which packages are installed the most?”
To head off a lot of future requests, the answer is no, no I can’t. We do not have any sort of popcorn database which shows what packages are popular. When a user requests the OS to install a package, there is no “Hey I am asking for Bob if I can install libfoobar” that gets sent to the Fedora servers. What yum, dnf, PackageKit, or Salt do is then request for the repo data, looks to see if there is a way to figure out what is wanted and then asks for any packages that it needs to get.
It is this data that I can sort of glean some sort of idea of most installed packages.. but I feel it is way past “Lies”, “Damn Lies”, and “Statistics” into regions like “Political Promises” or “Half Life 3 confirmed”. Looking over an entire month of requests, sorting the data, and ranking the requests, I find that a bunch of packages show up a lot while others fall off in a long tail. Things that make this data dirty are the fact that if 200 people ask for wordpress, 150 for mediawiki and 90 for nagios.. I will see various PHP trunk packages that all three want as a higher number. I can’t simply tell if the person wanted that PHP package by itself or wanted wordpress. [I could possibly try and work out a transaction of requested packages and figure out what nodes and leafs there might be.. but I found that the tools don’t always request from download.fedoraproject.org everything it is wanting because it possibly already ‘knows’ where something is.
In any case, here are the most requested packages to the download website for January.
- nagios-plugins-2 *lots of plugins show up here*
- munin *lots of munin packages here
- nagios-plugins-2 *lots of other nagios removed*
- nodejs-0 *lots of other nodejs removed*
- GeoIP-1 *other GeoIP removed*
- R-core-3 *lots of other R packages removed*
- globus-gssapi-gsi-devel-12 *lots of other globus removed*
- xrootd-client-libs-4 *lots of other xrootd removed*
[Edited: I forgot this part]
This list of agents which get used to pull down packages for EPEL and Fedora was rather interesting. I combined all the yum together as the many different versions kind of polluted the numbers but here are the top agents:
- Debian Apt-Cacher-NG
- Axel 2.4 (Linux)
Source From: fedoraplanet.org.
Original article title: Stephen Smoogen: Trying to get an idea about what packages are used.
This full article can be read at: Stephen Smoogen: Trying to get an idea about what packages are used.