For this blog, I decided to choose and explore the police expenditures dataset which details the financial data of police expenses from June 2011 to January 31, 2014. The data types are as follows: ID number, Fiscal year, Department name, Vendor name, Transaction date, Dollar amount, Authority, Business tax registration certificate, Government activity, Fund group name, Fund type, Fund name, Account name, Transaction ID, Expenditure type, Settlement/judgement, Fiscal month number, Fiscal year-month, Fiscal year-quarter, Calendar month number, Calendar month/year, Calendar month, Data source, Authority name, Authority link. Along with these columns, there are 226,210 rows of detailed expense transactions. The record in this data set is the total sum of all police expenditures which amounts to roughly 4.86 billion dollars during the aforementioned time period. Additionally, this dataset keeps record of each expense transaction made by the police department which makes it easier to determine the proper allocation of the money into/from appropriate funds.
As Wallack’s and Srinivasan’s definition of ontology suggests that it “merely implies a distinction between groups’ mental maps of their surroundings”, a dataset’s ontology is the transparency of links and boundaries which allows us to further understand a given dataset (Page 2). In other words, a dataset’s ontology is essentially ways in which a dataset’s connections can be recognized and traced. Similarly this dataset’s ontology allows for transparency and understanding of the funds being used by the police department. For instance, it can be reviewed to make sure that no illegal expense transaction occurred or for simple accounting purposes. This data can be helpful to those government agencies that have to estimate the amount of money that should be set aside for the police department from the budget. It creates accountability by both government and public. It also increases transparency for the tax payers who can track their tax dollars at work. It organizes which fund the transaction is to pull money from. It also keeps track of which officer is submitting the expense.
This dataset can be organized in many different ways to provide more information. It can give you a lot of information about where or which vendors the department spends most or least of its money to help understand where the resources are being pulled. The expenses range from cellphone bills to water bottles. You are able to prioritize whichever column you want allowing you freedom to organize the data in any order. Dataset can tell us exactly where the department is using its money and can be helpful in times of budgeting.
I think what got left out was more details. The dataset uses a lot of broad categories and at times uses the same type for many different transactions. For instance, the expenditure type “supplies and other services” is repeated for more than half of the items. It would help to create additional subcategories to keep the data clean and legible.
If I were starting over with data-collection and describe a completely different ontology, I would create more data types in order to organize the data even more. For instance, I would break up “supplies and other services” to additional data types which would further organize data and specify which commodities were purchased and see if I can’t put those into a separate category or data type. I would also add data types showing different communities or regions of LA and the money being spent there. Therefore, if we are policing in one area more than another resulting in overspending resources in one area, we ought to address the underlining issues of such a region and explore the true cause of turmoil there instead of simply over-policing and overspending.