For example, you might be doing a study of urban communities, but how exactly will you define these? You might use a definition based on population density, the percentage of the landscape covered by concrete, or the number of roads per square mile. Developing and documenting clear definitions for each variable and attribute to be included in your study is an essential step before going forward with data collection and analysis. Doing this in advance will help to ensure that everyone involved in the study understands the data to be collected and that each item will be mutually exclusive, thus avoiding confusion later in the process.
Public health example
The best way to determine the purpose of your research is to think about the central question that you want to have answered. For instance, maybe you are interested in examining whether people who live in a certain section of town (e.g., the poorer section) suffer from respiratory problems. The different parts of the town may be described as being “rich,” “middle class,” or “poor” based on the income levels of residents who live there. The question of health connected to income could be addressed using a GIS. Following the earlier mentioned guidelines for integrating GIS into the study, you would want to determine the study boundaries and relevant geographic features that are going to be part of the study. In this case, you are interested in drawing your study boundaries based on income. In other words, you want to draw a boundary around the low-income section of town, the mid-economic area, and the upscale part of town. Where would you begin? You could begin by looking at US Census data for that town to determine the clusters of different parts of town based on household income. You could then draw your boundaries based on the clustering observed in the US Census data. Step 1 in the process also calls for identifying other geographic features that might be important to your study. In this example, that would include determining the locations of different factories, incinerators, or other production facilities (figures 2.13 and 2.14).
Figure 2.13 Mapped locations of sites listed in the US Environmental Protection (EPA) Agency Toxic Release Inventory (TRI). Sites are displayed here on an interactive map of eastern Massachusetts. These maps help users visually explore data from the EPA’s TRI and Superfund Program. The underlying data are also available for download from the EPA website. Courtesy of the Division of Specialized Information Services of the US National Library of Medicine. Basemap data from Esri, DeLorme, USGS, NOAA, NGA, IFL.
Figure 2.14 Data underlying the map shown in figure 2.13. Data from EPA.
Step 2 in the integration of GIS into your study calls for developing a data dictionary. You as the researcher would determine what level of household income would fit your categories, for example, rich (yearly income over $45,000), middle class (yearly income between $20,000 and $44,999), and poor (yearly income under $19,999). In your GIS, you could also gather information on the level of emissions from these different facilities by looking at the Environmental Protection Agency’s website, where the EPA indicates industrial sites that emit beyond a certain specified level. This would be valuable information regarding air quality that could become a part of your GIS database.
You could interview people who live throughout the town and conduct a survey that inquires about their general health, income, and whether they suffer from any respiratory problems. As long as you know the geographic locations of respondents, you could enter this information, along with the survey answers, into a database. That way, when you conduct your analysis, you will be able to have geographic information for each unit of analysis (e.g., household). If you want to aggregate the data slightly to protect the privacy of the individuals, you could categorize respondents as living in a neighborhood versus at an actual street address. On further investigation, you may want to geographically locate various factories, incinerators, or other production-oriented facilities that could be emitting substances that affect people’s respiratory health.
Another important thing to consider in determining your research purpose is the general theme that is a part of your research question. In the foregoing example, some of the potential themes might be environmental health, social inequality, and/or environmental justice.
After determining the general purpose of your research, you can then ask the question, How would GIS be helpful to the project? In other words, how would using a GIS enhance the study? A GIS is useful because it facilitates a more holistic and contextual view of a research problem or issue. It accomplishes this through bringing together a variety of different data types. Any study that you choose to develop will most likely include a variety of important variables. The trick in using the GIS is to identify which variables will best be studied using a GIS. This topic is discussed in greater detail in the following chapter.
Moving forward
In this chapter, you learned about strengths and challenges to each letter in the GIS. Additionally, you learned about conceptualization and the right questions to consider as you begin to frame a spatially based research project. Furthermore, you learned what questions to ask yourself about data and analysis to help move you forward with your project. You also learned about the conceptual framework, a useful tool for guiding conceptualization. Chapter 3 discusses research design, which is fundamental to the research process.
Review questions
1. What are some difficulties with the G in GIS, as discussed in the chapter?
2. How do you determine project goals?
3. What is the difference between a variable and an attribute?
4. How do you know as a researcher when it is appropriate to employ data aggregation techniques?
5. What are three questions you can ask about data?
6. What are some questions you can ask about location as you design your research project?
7. What are four useful questions to ask about analysis?
8. What is the difference between a logical data model and a conceptual model?
Additional readings and references
Bernhardsen, T. 2002. Geographic Information Systems: An Introduction. 3rd ed. New York, NY: Wiley.
Bolsted, P. 2008. GIS Fundamentals: A First Text on Geographic Information Systems. 3rd ed. White Bear Lake, MS: Eider Press.
Ormsby, T., E. J. Napolean, R. Burke, and C. Groessl. 2010. Getting to Know ArcGIS Desktop. Redlands, CA: Esri Press.
O’Sullivan, D., and D. Unwin. 2010. Geographic Information Analysis. 2nd ed. New York, NY: Wiley.
Steinberg, S. L., and S. J. Steinberg. 2008. “People, Place, and Health: A Sociospatial Perspective of Agricultural Workers and Their Environment.” Humboldt State University. http://humboldt-dspace.calstate.edu/xmlui/handle/2148/428.
——— 2011. “Geospatial Analysis Technology and Social Science Research”. In Handbook of Emergent Technologies, ed. S. Hesse-Biber, 563–91. Oxford: Oxford University Press.