Queer European MD passionate about IT
Procházet zdrojové kódy

Update Mission193Solutions.ipynb

darinbradley před 2 roky
rodič
revize
4a80cc387b
1 změnil soubory, kde provedl 30 přidání a 30 odebrání
  1. 30 30
      Mission193Solutions.ipynb

+ 30 - 30
Mission193Solutions.ipynb

@@ -2617,15 +2617,15 @@
    "source": [
     "It looks like the game log has a record of over 170,000 games.  It looks like these games are chronologically ordered and occur between 1871 and 2016.\n",
     "\n",
-    "For each game we have:\n",
+    "For each game we have the following:\n",
     "\n",
-    "- general information on the game\n",
-    "- team level stats for each team\n",
-    "- a list of players from each team, numbered, with their defensive positions\n",
-    "- the umpires that officiated the game\n",
-    "- some 'awards', like winning and losing pitcher\n",
+    "- General information on the game\n",
+    "- Team level stats for each team\n",
+    "- A list of players from each team, numbered, with their defensive positions\n",
+    "- The umpires who officiated the game\n",
+    "- Some awards, like winning and losing pitcher\n",
     "\n",
-    "We have a `game_log_fields.txt` file that tell us that the player number corresponds with the order in which they batted.\n",
+    "We have a `game_log_fields.txt` file that tells us that the player number corresponds to the order in which they batted.\n",
     "\n",
     "It's worth noting that there is no natural primary key column for this table."
    ]
@@ -2742,9 +2742,9 @@
     "hidden": true
    },
    "source": [
-    "This seems to be a list of people with IDs.  The IDs look like they match up with those used in the game log.  There are debut dates, for players, managers, coaches and umpires.  We can see that some people might have been one or more of these roles.\n",
+    "This seems to be a list of people with IDs. The IDs look like they match up with those used in the game log. There are debut dates for players, managers, coaches, and umpires. We can see that some people might have played one or more of these roles.\n",
     "\n",
-    "It also looks like coaches and managers are two different things in baseball.  After some research, managers are what would be called a 'coach' or 'head coach' in other sports, and coaches are more specialized, like base coaches.  It also seems like coaches aren't recorded in the game log."
+    "It also looks like coaches and managers are two different things in baseball. After some research, managers are what we would called a *coach* or *head coach* in other sports, and coaches are more specialized, like base coaches.  It also seems that coaches aren't recorded in the game log."
    ]
   },
   {
@@ -2885,7 +2885,7 @@
     "hidden": true
    },
    "source": [
-    "This seems to be a list of all baseball parks.  There are IDs which seem to match with the game log, as well as names, nicknames, city and league."
+    "This seems to be a list of all baseball parks.  There are IDs that seem to match with the game log, as well as names, nicknames, city, and league."
    ]
   },
   {
@@ -3006,7 +3006,7 @@
     "hidden": true
    },
    "source": [
-    "This seems to be a list of all teams, with team_ids which seem to match the game log. Interestingly, there is a `franch_id`, let's take a look at this:"
+    "This seems to be a list of all teams, with team_ids that seem to match the game log. Interestingly, there is a `franch_id`, let's take a look at this:"
    ]
   },
   {
@@ -3042,7 +3042,7 @@
     "hidden": true
    },
    "source": [
-    "We might have `franch_id` occurring a few times for some teams, let's look at the first one in more detail."
+    "We might have `franch_id` occurring a few times for some teams. Let's look at the first one in more detail."
    ]
   },
   {
@@ -3142,7 +3142,7 @@
     "hidden": true
    },
    "source": [
-    "It appears that teams move between leagues and cities.  The team_id changes when this happens, `franch_id` (which is probably 'Franchise') helps us tie all of this together."
+    "It appears that teams move between leagues and cities.  The team_id changes when this happens, `franch_id` (which is probably *Franchise*) helps us tie all of this together."
    ]
   },
   {
@@ -3153,7 +3153,7 @@
    "source": [
     "**Defensive Positions**\n",
     "\n",
-    "In the game log, each player has a defensive position listed, which seems to be a number between 1-10.  Doing some research around this, I found [this article](http://probaseballinsider.com/baseball-instruction/baseball-basics/baseball-basics-positions/) which gives us a list of names for each numbered position:\n",
+    "In the game log, each player has a defensive position listed, which seems to be a number between 1-10. Doing some research, we find [this article](http://probaseballinsider.com/baseball-instruction/baseball-basics/baseball-basics-positions/), which gives us a list of names for each numbered position:\n",
     "\n",
     "1. Pitcher\n",
     "2. Catcher\n",
@@ -3165,11 +3165,11 @@
     "8. Center Field\n",
     "9. Right Field\n",
     "\n",
-    "The 10th position isn't included, it may be a way of describing a designated hitter that does not field.  I can find a retrosheet page that indicates that position `0` is used for this, but we don't have any position 0 in our data.  I have chosen to make this an 'Unknown Position' so I'm not including data based on a hunch.\n",
+    "The 10th position isn't included. It may be a way of describing a designated hitter that does not field. We can find a retrosheet page that indicates that position `0` is used for this, but we don't have any position 0 in our data. We have chosen to make this an *Unknown Position*, so we're not including data based on a hunch.\n",
     "\n",
     "**Leagues**\n",
     "\n",
-    "Wikipedia tells us there are currently two leagues - the American (AL) and National (NL). Let's start by finding out what leagues are listed in the main game log:"
+    "Wikipedia tells us there are currently two leagues — the American (AL) and National (NL). Let's start by determining which leagues are listed in the main game log:"
    ]
   },
   {
@@ -3206,7 +3206,7 @@
     "hidden": true
    },
    "source": [
-    "It looks like most of our games fall into the two current leagues, but that there are four other leagues.  Let's write a quick function to get some info on the years of these leagues:"
+    "It looks like most of our games fall into the two current leagues, but there are four other leagues. Let's write a quick function to get some info on the years of these leagues:"
    ]
   },
   {
@@ -3247,7 +3247,7 @@
     "hidden": true
    },
    "source": [
-    "Now we have some years which will help us do some research.  After some googling we come up with:\n",
+    "Now we have some years, which will help us do some research. After some googling we come up with this list:\n",
     "\n",
     "- `NL`: National League\n",
     "- `AL`: American League\n",
@@ -3256,7 +3256,7 @@
     "- `PL`: [Players League](https://en.wikipedia.org/wiki/Players%27_League)\n",
     "- `UA`: [Union Association](https://en.wikipedia.org/wiki/Union_Association)\n",
     "\n",
-    "It also looks like we have about 1000 games where the home team doesn't have a value for league."
+    "It also looks like we have about 1,000 games where the home team doesn't have a value for league."
    ]
   },
   {
@@ -3506,13 +3506,13 @@
     "The following are opportunities for normalization of our data:\n",
     "\n",
     "- In `person_codes`, all the debut dates will be able to be reproduced using game log data.\n",
-    "- In `team_codes`, the start, end and sequence columns will be able to be reproduced using game log data.\n",
-    "- In `park_codes`, the start and end years will be able to be reproduced using game log data.  While technically the state is an attribute of the city, we might not want to have a an incomplete city/state table so we will leave this in.\n",
-    "- There are lots of places in `game` log where we have a player ID followed by the players name.  We will be able to remove this and use the name data in `person_codes`\n",
-    "- In `game_log`, all offensive and defensive stats are repeated for the home team and the visiting team.  We could break these out and have a table that lists each game twice, one for each team, and cut out this column repetition.\n",
-    "- Similarly, in `game_log`, we have a listing for 9 players on each team with their positions - we can remove these and have one table that tracks player appearances and their positions.\n",
-    "- We can do a similar thing with the umpires from `game_log`, instead of listing all four positions as columns, we can put the umpires either in their own table or make one table for players, umpires and managers.\n",
-    "- We have several awards in `game_log` like winning pitcher and losing pitcher.  We can either break these out into their own table, have a table for awards, or combine the awards in with general appearances like the players and umpires."
+    "- In `team_codes`, the start, end, and sequence columns will be able to be reproduced using game log data.\n",
+    "- In `park_codes`, the start and end years will be able to be reproduced using game log data. While technically the state is an attribute of the city, we might not want to have a an incomplete city/state table, so we will leave this in.\n",
+    "- There are many places in `game` log where we have a player ID followed by the players name. We will be able to remove this and use the name data in `person_codes`.\n",
+    "- In `game_log`, all offensive and defensive stats are repeated for the home team and the visiting team. We could break these out and have a table that lists each game twice, one for each team, and cut out this column repetition.\n",
+    "- Similarly, in `game_log`, we have a listing for 9 players on each team with their positions  we can remove these and have one table that tracks player appearances and their positions.\n",
+    "- We can do a similar thing with the umpires from `game_log`. Instead of listing all four positions as columns, we can put the umpires either in their own table or make one table for players, umpires, and managers.\n",
+    "- We have several awards in `game_log`, like winning pitcher and losing pitcher. We can either break these out into their own table, have a table for awards, or combine the awards in with general appearances like the players and umpires."
    ]
   },
   {
@@ -4148,7 +4148,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Adding The Team and Game Tables"
+    "## Adding the Team and Game Tables"
    ]
   },
   {
@@ -5945,9 +5945,9 @@
  ],
  "metadata": {
   "kernelspec": {
-   "display_name": "dscontent",
+   "display_name": "Python 3",
    "language": "python",
-   "name": "dscontent"
+   "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
@@ -5959,7 +5959,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.4.4"
+   "version": "3.8.5"
   },
   "notify_time": "5"
  },