workshop under development. The workshop, scheduled for October 2013, will be aimed at graduate student researchers but open to all. The authors have mapped the core competencies for data information literacy outlined in "Determining Data Information Literacy Needs: A Study of Students and Research Faculty" (2011) with the workshop learning outcomes. Workshop planning considerations, content, and problems faced are described below. While this poster describes a workshop being developed at Indiana University, the authors hope this model will prove informative for other institutions looking to create similar workshops pairing data management and visualization. Planning Process •Tools chosen are freely available; two are open source •Team met weekly to discuss learning outcomes, data sources, data analysis techniques, and assessment strategies Some ideas were discarded over the course of planning: •Digital humanities data in HathiTrust (HT) and HT-related software were not publicly available for use, so the authors replaced HT data analysis with that of open humanities data provided by the IU Digital Collection Services. We plan to use the tool Voyant to visualize this data. •“Frameworks for a Data Management Curriculum” (Lamar Soutter Library et al, 2012) course plans were originally intended to be adapted for this workshop. They proved to be too technical and detailed for incorporation into a beginner-level class. Considerations Designing our tutorials •Emphasize analysis over datasets themselves •Future workshops responsive to feedback Finding appropriate data sources •Diversity of available datasets •Chose a wide breadth of examples to fit our expected audience •Criteria for selection: usable format, properly documented, serendipity Assessing the learning outcomes •Difficult in one-off workshops •Self-reporting of understanding less than perfect •Project-based assessment would be ideal References Carlson, Jake R.; Fosmire, Michael; Miller, Chris; and Sapp Nelson, Megan, "Determining Data Information Literacy Needs: A Study of Students and Research Faculty" (2011). Libraries Faculty and Staff Scholarship and Research. Paper 23. http://docs.lib.purdue.edu/lib_fsdocs/23 Lamar Soutter Library, University of Massachusetts Medical School and the George C. Gordon Library, and Worcester Polytechnic Institute. (2012). “Frameworks for a Data Management Curriculum: Course plans for data management instruction to undergraduate and graduate students in science, health sciences, and engineering programs.” Retrieved from http://library.umassmed.edu/data_management_frameworks.pdf Integrating Data Management Literacies with Data Visualization Instruction A One-Shot Workshop DIL Core Competencies mapped to Learning Outcomes DIL Core Competencies Learning Outcome Skills Required Assessment Measure Data Management and Organization Introduction to Databases and Data Formats Data Conversion and Interoperability Understand data management and organization concepts. Identify data types, define data and its lifecycle, and describe how data is used and reused in research. List reasons why data management and organization is important to researchers. Related questions on incoming and outgoing questionnaires. Data Visualization Quality Assurance Design effective data visualizations. Apply Illinsky’s four principles of data visualization. Use Google Refine to clean data in order to assure data quality. Use Sci2 to perform various types of analysis on data sets. Use Gephi to visualize the data. Critique visualizations according to rubric. Introduction to Databases and Data Formats Discovery and Acquisition of Data Data Analysis Find and select appropriate data—and apply proper analysis—to create a visualization that answers a particular research question. Identify research question. Choose appropriate types and formats of data for topical, network, burst, and temporal analysis. Navigate to data sources. Download data in proper format. Analyze data. Successful completion of questionnaire addressing skills needed to answer research question. Cultures of Practice Introduction to Databases and Data Formats Understand how cultures of practice influence the way data may be collected, described, or formatted. Based on intended audience for visualization and source of data, identify culture of practice/discipline. Be able to access information on discipline’s data collection standards [literature search], relevant metadata schema and controlled vocabularies [libraries website and DCC guide], and what tools and formats are common to particular disciplines [literature search]. Successful literature searches (and reading) for data collection and analysis methodologies. Successful access of disciplinary metadata schema. Metadata Data Preservation Save data to IU-supported research storage for both short- and long-term storage. Identify best storage options for short- and long-term data storage. Successfully navigate to appropriate storage solutions, log in. Transfer data from local drives to cloud storage accounts. Questionnaire. Ethics (including citation of data) Data Preservation Handle data and data visualizations ethically. Cite data from other sources in visualizations and documentation. Decide which data to make available, based on sensitivity. Store data on appropriate technologies using safeguards, based on sensitivity. Create visualizations that accurately represent the source dataset (i.e. does not manipulate or skew results). Questionnaire. [In a more in-depth workshop, one idea would be to have students create projects which are then reviewed and scored against a rubric after class ends.] Data Curation and Re-use Ethics (including citation of data) Data Management and Organization Prepare data and visualizations for re-use. Create documentation for others to reference when reusing data that describes methodology for finding, analyzing, and visualizing data. Cite data source and visualization creator in visualization caption(s). Organize and format data appropriately for future processing by tools. Questionnaire. [In a more in-depth workshop, one idea would be to have students create projects which are then reviewed and scored against a rubric after class ends.] Stacy Konkiel, Brianna Marshall & David Edward Polley | Indiana University-Bloomington • Topical • Temporal • Spatial • Network Workshop Specifics • ISI / Web of Knowledge • Open Congress • ICPSR • Open humanities data from IU Tools Data Sources Analyses Contact
[email protected] // @skonkiel
[email protected] // @nososternlib Download poster and handouts http://hdl.handle.net/2022/16814 This work is licensed under a Creative Commons Attribution 3.0 United States License. https://scholarworks.iu.edu/dspace/handle/2022/16814 !