App Data Analysis by Python

5 minute read

App Data Analysis by Python

DESCRIPTION

This project’s aim is to find mobile app profiles that are profitable for the App Store and Google Play markets.

This report helps developers understand what kinds of apps are likely to attract more users and make data-driven decisions to the kind of apps they would build. Note: only free English apps are analysed.

INPUT

A data set containing data about approximately ten thousand Android apps from Google Play. You can find more details and download the data set from this kaggle link.

A data set containing data about approximately seven thousand iOS apps from the App Store. You can find more details and download the data set from this kaggle link.

DETAIL STEPS

I represent in 2 ways: the first one working with python dataframe and libraries, the second one using python lists and built-in functions.

Below are some main step, please click the Google Colab link to see full code and how it works.

  • Step 0: Understanding the data (Open file, Initial Data Checks by finding colunm, row count, duplicate records and missing values)
  • Step 1: Data Cleaning (Delete Wrong Data, Removing Duplicate and Inappropriate Records)
  • Step 2: Data Analysis (Find Most Common Apps by Genre, Most Popular Apps by Genre)
  • Step 3: Data Visualization

OUTPUT/FINDINGS

Most common genre

On App Store

The general impression is that App Store is dominated by apps that are designed for fun (mostly games - 58.14%, entertainment - 8%, photo and video - 5%, etc.), while apps with practical purposes (education - 3.6%, others < 3% each) are more rare.


Top 5 and bottom 5 most common genres on Apple Store.

On Google Play

While the App Store is dominated by apps designed for fun, Google Play shows a more balanced landscape of both practical (family - 18.3%) and for-fun apps (game - 10.8%).


Top 5 and bottom 5 most common genres on Google Play Store.

It seems that a good number of apps are designed for practical purposes (family, tools, business, productivity, etc.). However, if we investigate this further, we can see that mostly apps in the family category are made with entertainment purpose (57.7&) and education purpose (27.5%).


Most common categories of "Family" genre on Google Play Store.

So, we found that most common apps in both platforms are for entertainment.

Now we’d like to get an idea about the kind of apps that have most users.

On App Store


Top 10 most popular genres on Apple Store.

Most of the popular genres might seem more popular than they really are and be dominant by a few giants.

  • On average, navigation apps have the highest number of user reviews (more than 86090 user reviews), but this figure is heavily influenced by Waze (account for 67% total reviews) and Google Maps (30%), which have close to half a million user reviews together.
  • The same pattern applies to
    • Reference apps, where have the Bible (account for 73% total reviews) and Dictionary.com which skew up the average rating.
    • Social networking apps, where the average number is heavily influenced by a few giants like Facebook (39%), Pinterest (14%), Skype, etc.
    • Music apps, where a few big players like Pandora (30%), Spotify (23%), and Shazam heavily influence the average number.
    • Book apps, where also have big players like Kindle (45%), Audible (19%).
    • Food and drink — examples here include Starbucks (35%), Domino’s Pizza(30%), etc.
    • Travel - Google Earth (40%), Yelp (20%),…
    • Photo & video apps - have Instagram (47%) which actually social networking apps.

Other genres that seem popular include weather or finance:

  • Weather apps — people generally don’t spend too much time in-app, and the chances of making profit from in-app adds are low. Also, getting reliable live weather data may require us to connect our apps to non-free APIs.
  • Finance apps — these apps involve banking, paying bills, money transfer, etc.

The average number of ratings seem to be skewed by very few apps which have hundreds of thousands of user ratings, while the other apps may struggle to get past the 10,000 threshold.


Distribution of app users on Apple Store.

After removing apps that have more than 10,000 reviews, we can see some new genres like Business, Shopping, Health & fitness and Productivity. Beside Shopping and Productivity, other genres share the same dominant pattern as describe above.


Top 10 most popular genres on Apple Store (below 10 thousands reviews).

To temperarily conclude, protential genres on Apple Store are Finance, Shopping and Productivity apps.

On Google Play


Top 10 most popular genres on Google Play Store.

On Google Play Store, communication, social and video players has most installs but again, the main concern is that these app genres might seem more popular than they really are. Moreover, these niches seem to be dominated by a few giants who are hard to compete against.

  • On average, communication apps have the most installs: 71,452,385. This number is heavily skewed up by a few apps that have over one billion installs (WhatsApp, Facebook Messenger, Skype, Google Chrome, Gmail, and Hangouts), and a few others with over 100 and 500 million installs.
  • We see the same pattern for the social category, which is the runner-up with 45,176,396 installs. The market is dominated by apps like Facebook, Instagram, Google+, etc..
  • The pattern is repeated for
    • video players apps (where we have giants like Youtube, Google Play Movies & TV, or MX Player)
    • photography apps (Google Photos and other popular photo editors)
    • productivity apps (Microsoft Word, Dropbox, Google Calendar, Evernote, etc.).

The average number of ratings seem to be skewed by very few apps which have milions of user installs, while the other apps may struggle to get past the billion (1,000,000,000 or 10^9) threshold.


Distribution of app users on Google Play Store.

After removing apps that have more than one billion installs, most popular genres on Google Play Store are Entertainment, Education and Games.

  • The game or entertainment genre seems pretty popular, but previously we found out this part of the market seems a bit saturated (most common genre), so we’d like to come up with a different app recommendation if possible.


Top 10 most popular genres on Google Play Store (below 10 milions installs).

Conclusions

In this project, we analyzed data about the App Store and Google Play mobile apps with the goal of recommending an app profile that can be profitable for both markets.

We concluded that taking a popular book (perhaps a more recent book) and turning it into an app could be profitable for both the Google Play and the App Store markets. The markets are already full of libraries, so we need to add some special features besides the raw version of the book. This might include daily quotes from the book, an audio version of the book, quizzes on the book, a forum where people can discuss the book, etc.

Back to Top ↑