|
@@ -690,7 +690,7 @@
|
|
|
},
|
|
|
"cell_type": "markdown",
|
|
|
"source": [
|
|
|
- "The function is still not perfect, and very few non-English apps might get past our filter, but this seems good enough at this point in our analysis — we shouldn't spend a too much time on optimization at this point.\n",
|
|
|
+ "The function is still not perfect, and very few non-English apps might get past our filter, but this seems good enough at this point in our analysis — we shouldn't spend too much time on optimization at this point.\n",
|
|
|
"\n",
|
|
|
"Below, we use the `is_English()` function to filter out the non-English apps for both data sets:"
|
|
|
]
|
|
@@ -1342,7 +1342,7 @@
|
|
|
"source": [
|
|
|
"However, this niche seems to show some potential. One thing we could do is take another popular book and turn it into an app where we could add different features besides the raw version of the book. This might include daily quotes from the book, an audio version of the book, quizzes about the book, etc. On top of that, we could also embed a dictionary within the app, so users don't need to exit our app to look up words in an external app.\n",
|
|
|
"\n",
|
|
|
- "This idea seems to fit well with the fact that the App Store is dominated by for-fun apps. This suggests the market might be a bit saturated with for-fun apps, which means a practical, purposed app might have more of a chance to stand out among the huge number of apps on the App Store.\n",
|
|
|
+ "This idea seems to fit well with the fact that the App Store is dominated by for-fun apps. This suggests the market might be a bit saturated with for-fun apps, which means a practical app might have more of a chance to stand out among the huge number of apps on the App Store.\n",
|
|
|
"\n",
|
|
|
"Other genres that seem popular include weather, book, food and drink, or finance. The book genre seem to overlap a bit with the app idea we described above, but the other genres don't seem too interesting to us:\n",
|
|
|
"\n",
|
|
@@ -1410,7 +1410,7 @@
|
|
|
"source": [
|
|
|
"One problem with this data is that is not precise. For instance, we don't know whether an app with 100,000+ installs has 100,000 installs, 200,000, or 350,000. However, we don't need very precise data for our purposes — we only want to get an idea which app genres attract the most users, and we don't need perfect precision with respect to the number of users.\n",
|
|
|
"\n",
|
|
|
- "We're going to leave the numbers as they are, which means that we'll consider that an app with 100,000+ installs has 100,000 installs, and an app with 1,000,000 install has 1,000,000 installs, and so on.\n",
|
|
|
+ "We're going to leave the numbers as they are, which means that we'll consider that an app with 100,000+ installs has 100,000 installs, and an app with 1,000,000+ installs has 1,000,000 installs, and so on.\n",
|
|
|
"\n",
|
|
|
"To perform computations, however, we'll need to convert each install number to `float` — this means that we need to remove the commas and the plus characters, otherwise the conversion will fail and raise an error. We'll do this directly in the loop below, where we also compute the average number of installs for each genre (category)."
|
|
|
]
|