Explore chapters and articles related to this topic
Reading in data locally and from the web
Published in Tiffany Timbers, Trevor Campbell, Melissa Lee, Data Science, 2022
Tiffany Timbers, Trevor Campbell, Melissa Lee
Let's dig into that line above a bit more. You can see that that bit of code has an opening tag (words between < and >, like < span>) and a closing tag (the same with a slash, like < /span>). HTML source code generally stores its data between opening and closing tags like these. Tags are keywords that tell the web browser how to display or format the content. Above you can see that the information we want ($800) is stored between an opening and closing tag (< span> and < /span>). In the opening tag, you can also see a very useful “class” (a special word that is sometimes included with opening tags): class="result-price". Since we want R to programmatically sort through all of the source code for the website to find apartment prices, maybe we can look for all the tags with the "result-price" class, and grab the information between the opening and closing tag. Indeed, take a look at another line of the source snippet above:
Functional Architectures for Indexing and Keywording
Published in Denise Bedford, Knowledge Architectures, 2020
Keywords are most often defined simply as words or concepts of great significance. It is a good definition because it accurately describes keywords as concept-level terms. And, it highlights their role – they are chosen because we believe they best represent the ideas expressed in the asset. It is common practice for keywords to be unmanaged and free form. We expect them to be free form expressions of our mental concepts. Keywords are a closer representation of concept-level indexing than indexing with a controlled vocabulary for the simple reason that they reflect an individual’s implicit thoughts and their understanding of meaning. Consider the different sources of keywords. Authors assign keywords to their publications and presentations. Publishers and editors assign keywords to submitted manuscripts. Staff across organizations assign keywords to their documents. People assign keyword tags to their social posts, web publications, posted photographs, posted videos, audios, and words of art. Keywords are probably the primary access point for any kind of knowledge asset published to the Web today. The practice of informal and unmanaged keywording far outpaces either professionally managed term indexing or even machine-based semantic indexing.
Balancing the Fluency-Consistency Tradeoff in Collaborative Information Search with a Recommender Approach
Published in International Journal of Human–Computer Interaction, 2018
Paul Seitlinger, Tobias Ley, Dominik Kowald, Dieter Theiler, Ilire Hasani-Mavriqi, Sebastian Dennerlein, Elisabeth Lex, Dietrich Albert
Different research perspectives on social tagging (e.g., Fu, Kannampallil, Kang, & He, 2010; Lorince & Todd, 2016; Nelson et al., 2009; Pirolli & Kairam, 2012; Schweiger, Oeberst, & Cress, 2014) suggest that the use of tag-based annotations in digital curation could support such balancing. Tags are freely chosen keywords with which users describe resources on the Web and which may be visible to others. On the one hand, as social tags reveal other members’ thoughts, they trigger cognitive conflicts and inspire new ideas during individual experimentation (e.g., Schweiger et al., 2014). On the other hand, given sufficient consistency in applying certain tags for reoccurring topics, they support tag-based sharing of collected resources and facilitate an exploitation of own and others’ search results (e.g. Fu et al., 2010; Lorince & Todd, 2016; Nelson et al., 2009; Pirolli & Kairam, 2012).
Adapting Information Search Tools for use by Health Consumers: Challenges and Lessons for Software Designers
Published in International Journal of Human–Computer Interaction, 2018
Mario A. Hernández, Joseph Sharit, Peter Pirolli, Sara J. Czaja
The majority of consumers use general search engines to find health information on the Internet, which necessarily exposes them to a large number of highly variable website sources. Mr. Taggy is a search browser that can aid users by helping them select search terms that facilitate their ability to identify information relevant to the health problem of interest. This browser allows users to use feedback stemming from tags that other users have ascribed to various information topics, thereby promoting more effective exploration of the topic space. Tags are words and phrases established by other users, which describe both the content and context of a web page. Mr. Taggy also provides recommendations for related tags and search terms, which can aid users in learning the vocabulary for a particular domain and thus help with the refinement of later searches.
Smart city data architecture for energy prosumption in municipalities: concepts, requirements, and future directions
Published in International Journal of Green Energy, 2020
Furthermore, data layer stores processed energy data from metering devices, energy sensors, etc. in MongoDB database as suggested by Zabasta et al. (2018). MongoDB is opted as a good choice for storing JSON encoded data as it internally retains data in a well-organized binary JSON format (BSON) (Zabasta et al. 2018). This layer also stores Metadata, which refers to the tags or descriptors that classify a document, dataset, data model, or data source (Tcholtchev et al., 2017). The metadata information is automatically generated by data processing system based on a standard of DataCite Metadata Schema and serves as a link to isolated data and aids data fusion in smart cities (Liu et al. 2017).