Score Data Explained

Error message

Warning: file_get_contents(http://m.sports.naver.com/baseball/record/index.nhn) [function.file-get-contents]: failed to open stream: HTTP request failed! HTTP/1.1 404 Not Found in include_once() (line 3 of /www/npaul.godohosting.com/sidestandingsfixed.php).

For a bit of background information, The data here is made publicly available by the KBO (in Korean). It is updated in real-time and the available data dates back to the 2008 season. The game data is automatically translated into English using a simple search and replace script that I developed.

The translation includes transliterations of player names for the full team rosters of the 2013 season. This turned out to be much more of a hassle than I had anticipated for a few reasons:

  • There is no standard, accepted, official method of transliterating Korean names. There are some basic accepted practices (김=Kim, 이=Lee, etc), but beyond that more or less anything goes. One Korean name might be written several different ways in English, and usually it comes down to the personal preference of the individual in question. (example: 김주영 may be written Kim Ju-Young, Kim Joo-Young, Kim Ju-Yeong, Kim Ju-Yong, etc...)
  • There is no good place to look for "official" information. Some of the players who have played internationally, in the Olympics, WBC, etc might have their names printed in English on a team roster somewhere, but the majority of the league does not.
  • Not wanting to translate all of the names by hand, I decided to use Wikipedia as a source. The vast majority of KBO players have a wikipedia page. However, this also caused problems. Some names were written differently on the Korean and English versions of the Wikipedia pages, and some of the translations turned out to be so unusual that I decided to manually override a few of them.
  • Further complicating matters is the fact that there are a few players in the league that share the exact same name. This causes problems linking to the stat pages. I still haven't found a perfect solution to this problem.

So the process was a bit of a pain, and if I had to do it all over again I would probably do it differently, but it's mostly done. The exceptions are that the player name database only includes names from 2013 rosters. If you choose to look at results of games in past seasons you will see some untranslated names. It would be possible to include rosters from past seasons, but I don't think it's worth the time or effort. The only other thing that may cause problems would be a mid-season roster change or addition. This would require a manual update to the player database. Once they are in the database everything is automatic. If you find any problems with player names, or have any suggestions for improvement, please contact me.

You will also see below the game score that sometimes additional Korean words appear.These are usually game notes, statistics, or factoids of some kind. For example, "player X got his 1,000th career hit in the 5th inning" or "most runs allowed by team X since August 5th, 2009", etc. These are sometimes included by the KBO with their score reports. Translating these automatically is much more problematic than the game data, team names, or player names because we cannot predict what these notes will say before they happen. So I was faced with the decision of either leaving them in Korean or removing them entirely. For the time being I have decided to leave them in Korean. I've started to translate these little by little, but it will never be 100% automatic like the names/teams are. I would appreciate any feedback you can give.

 

With regards to reading the data, please see below for details.

  • Starting Pitcher data explained: (innings pitched, batters faced, P(pitch count), H(hits given up), HR(homeruns given up), B(walks), K(strikeouts), total runs, earned runs)
  • Example: (5,20,P91,H4,HR1,B2,K3,2,2) : 5 innings pitched, 20 batters faced, 91 pitches thrown, 4 hits given up, 1 homerun given up, 2 walks, 3 strikeouts, 2 runs, 2 earned runs
  • Relief Pitcher data explained: the number shown is the inning in which the pitcher entered the game.
  • Homerun data explained: (inning, runs)season total
  • Example (4,2)7 : player hit 2-run homerun in 4th inning, his 7th homerun of the season.
  • Starting pitcher information for the following day's games is (usually) updated the night before.
  • There are (usually) no games on Mondays