LIS2600: Course Blog: 2012

Sunday, April 22, 2012

Week 15: Reading Notes

1) Galen Gruman. “What cloud computing really means” InfoWorld, April 2008. http://www.infoworld.com/article/08/04/07/15FE-cloud-computing-reality_1.html

"Cloud computing comes into focus only when you think about what IT always needs: a way to increase capacity or add capabilities on the fly without investing in new infrastructure, training new personnel, or licensing new software. Cloud computing encompasses any subscription-based or pay-per-use service that, in real time over the Internet, extends IT's existing capabilities."

2) Explaining Cloud Computing http://www.youtube.com/watch?v=hplXnFUlPmg&NR=1

-Google Docs is an example of cloud computing.

-"Cloud computing may bring an a return to centralized computing."

3) Thomas Frey. The Future of Libraries: Beginning the Great Transformation http://www.davinciinstitute.com/page.php?ID=120

"People who in the past visited libraries to find specific pieces of information are now able to find that information online. The vast majority of people with specific information needs no longer visit libraries. However, others who read for pleasure as example, still regularly patronize their local library."

-I disagree with this. Not only do people who read for pleasure visit their local library, but people who wish to find specific books (academic and otherwise), attend special events, learn a new language, get job and career advice--among other programs--also frequent their local library. Not everything can be placed online.

"A culture-based library is one that taps into the spirit of the community, assessing priorities and providing resources to support the things deemed most important. Modern day cultural centers include museums, theaters, parks, and educational institutions. The library of the future could include all of these, but individual communities will be charged with developing an overall strategy that reflects the identity and personality of its own constituency."

-I do agree with libraries having the potential, or the necessity, to become cultural centers, as many libraries already are. Large, urban public libraries already have the ability to be central to their communities, while smaller public libraries are often the only public place with the ability to celebrate an area's local heritage--without other cultural institutions nearby, such as a museum.

Monday, April 16, 2012

Week 13: Lab

http://liswiki.org/wiki/User:Hmg24

Sunday, April 15, 2012

Week 14: Reading Notes

1) No place to hide site: http://www.noplacetohide.net/

After reading the final chapter to "No Place to Hide," noting the various types of data mining as an individual walks through New York City, I am intrigued with the concept of information being collected everywhere. A person never thinks about it much, living in such a digital society, but the capabilities are there. "It takes less and less effort each year to know what each of us is about" (O'Hara). A very telling thought, indeed.

2) TIA and data mining http://www.epic.org/privacy/profiling/tia/

-Electronic Privacy Information Center
-Total "Terrorism" Information Awareness (TIA)
-TIA was developed as a tracking system, a means to track information about private data without warrants.
"In September 2003, Congress eliminated funding for the controversial project and closed the Pentagon's Information Awareness Office, which had developed TIA. This does not, however, necessarily signal the end of other government data-mining initiatives that are similar to TIA."

3) MyTurn: Protecting privacy rights in libraries, By Judah Hamer • September 24, 2008 http://greatlibrarynews.blogspot.com/2008/09/myturn-protecting-privacy-rights-in.html

This article applauds Vermont's law on stricter patron privacy policies and its necessity in libraries:
"In a library, the right to privacy means the right to open inquiry. Individuals must be able to seek information about any subject without fear of judgment, criticism or scrutiny of others. Freedom of speech is meaningless without the freedom to receive information; they are the underpinnings of a healthy democracy."

Sunday, April 8, 2012

Week 13: Reading Notes

1) John Blossom (2009). What makes social media tick: seven secrets of social media. Content Nation, chapter 2. Wiley Publishing Inc. Blossom_Content_Nation_7_Secrets_Social_Media.pdf

-user-generated content
-social v. collaborative publishing
-social media and branding/marketing

2) Charles Allan, “Using a wiki to manage a library instruction program: Sharing knowledge to better serve patrons, C&RL News, April 2007 Vol. 68, No. 4 Using a wiki to manage a library instruction program.htm

-"Creating your own wiki is a fairly straightforward process. Commercial sites abound on the Internet that allow anyone to create a free wiki. Some of the best known sites are seedwiki (seedwiki.com), pbwiki (pbwiki org), jotspot (http://www.jot.com/), twiki.org (twiki.org), and Phpwiki (phpwiki.org). Like free but basic e-mail service, free wikis are lacking in bells and whistles but seem to suffice for most users' needs. The option to upgrade (for a fee) is always available. Find the company with the most user-friendly interface and create an account."
-"centralized resource collaboration tool"
-Wikis are in general very similiar to the popularity of Google Documents...shared, collaborative workspaces which allow multiple users to contribute simutaneously and see updates when logging back in. The wiki system software might be different, but the idea of how online collaboration continues to be explored is an interesting concept.

3) Xan Arch, “Creating the academic library folksonomy: Put social tagging to work at your institution” C&RL News, February 2007 Vol. 68, No. 2 http://www.mendeley.com/research/creating-academic-library-folksonomy/?mrr_wp=0.1

-social tagging = saving bookmarks online
-"A few libraries are trying out social tagging: the University of Pennsylvania (UP) was one of the first library adopters with its PennTags (tags.library.upenn.edu/). The site allows UP students, faculty, and staff to bookmark quality Web sites and records in UP's online catalog and share these resources with others. Additionally UP users can create and share "projects" or groups of links on a single site named for the topic."

4) Jimmy Wales: “How a ragtag band created Wikipedia” http://www.ted.com/index.php/talks/jimmy_wales_on_the_birth_of_wikipedia.html

-goal was to give a free encyclopedia to everyone on the planet
-how they manage quality control: neutral point-of-view policy
-wiki model is how they work, not "web anarachists"..."passion of the community is for the quality of the work, not the process used to generate it"
-wiki books projects..."freely licensed textbooks are the next big thing in education"

Monday, April 2, 2012

Week 11 Lab

Google Scholar query: "digital library 2008 ... 2012"

Web of Science query: "digital library" narrow terms to 2008 - 2012

Google Scholar query: "virtual reference 2008 ... 2012"

Web of Science query: "virtual reference" narrow terms to 2008 - 2012

Sunday, April 1, 2012

Week 12: Reading Notes

1) David Hawking , Web Search Engines: Part 1 and Part 2 IEEE Computer, June 2006. http://www.computer.org/portal/web/csdl/doi/10.1109/MC.2006.213 and http://www.computer.org/portal/web/csdl/doi/10.1109/MC.2006.286

Part 1
GYM search engines - Google, Yahoo!, and Microsoft
"Currently, the amount of Web data that search engines crawl and index is on the order of 400 terabytes, placing heavy loads on server and network infrastructure. Allowing for overheads, a full crawl would saturate a 10-Gbps network link for more than 10 days" (86)
"Engineering a Web-scale crawler is not for the unskilled or fainthearted. Crawlers are highly complex parallel systems, communicating with millions of different Web servers, among which
can be found every conceivable failure mode, all manner of deliberate and accidental crawler traps, and every variety of noncompliance with published standards." (88)

Part 2
Indexers can create a file in two phases: scanning and inversion
"Compression. Indexers can reduce demands on disk space and memory by using compression algorithms for key data structures. Compressed data structures mean fewer disk accesses and can lead to faster indexing and faster query processing, despite the CPU cost of compression and decompression." (88)
"The major problem with the simplequery processor is that it returns poor results. In response to the query “the Onion” (seeking the satirical newspaper site), pages about soup and gardening would almost certainly swamp the desired result." (90)
"A high priority for search engine operation is monitoring the search quality to ensure that it does not decrease when a new index is installed or when the search algorithm is modiﬁed." (90)

2) Shreeves, S. L., Habing, T. O., Hagedorn, K., & Young, J. A. (2005). Current developments and future trends for the OAI protocol for metadata harvesting. Library Trends, 53(4), 576-589.

OAI-PMH = Open Access Initiative Protocol for Metadata Harvesting (2001)
data providers or repositories = make metadata available
service providers or harvesters = selectively harvest metadata
Open Language Archives Community = creating a worldwide language resource
Extendible Repository Resource Locators = ERRoLs (allows an OAI repository to stand alone as a web application)
"Controlled vocabularies will become more important..." (587)

3) MICHAEL K. BERGMAN, “The Deep Web: Surfacing Hidden Value” e p http://quod.lib.umich.edu/cgi/t/text/text-idx?c=jep;view=text;rgn=main;idno=3336451.0007.104

Standard search engines cannot find websites in the "deep web"
"Public information on the deep Web is currently 400 to 550 times larger than the commonly defined World Wide Web."
"Legitimate criticism has been leveled against search engines for these indiscriminate crawls, mostly because they provide too many results"
"Serious information seekers can no longer avoid the importance or quality of deep Web information. But deep Web information is only a component of total information available. Searching must evolve to encompass the complete Web."

Sunday, March 25, 2012

Week 11: Reading Notes

1) Mischo, W. (July/August 2005). Digital Libraries: challenges and influential work. D-Lib Magazine. 11(7/8). http://www.dlib.org/dlib/july05/mischo/07mischo.html

Abstracting and Indexing services (A&I)
Digital Library research funded by National Science Foundation 1993-1994
DLI-1 (Digital Libraries Initiative, first project, 1994 - 1998)
DLI-2 (1998)

2) A. et al. (July/August 2005). Dewey meets Turing: librarians, computer scientists and the digital libraries initiative. D-Lib Magazine. 11(7/8). http://www.dlib.org/dlib/july05/paepcke/07paepcke.html

"In 1994 the National Science Foundation launched its Digital Libraries Initiative (DLI). The choice of combining the word digital with library immediately defined three interested parties: librarians, computer scientists, and publishers. The eventual impact of the Initiative reached far beyond these three groups. The Google search engine emerged from the funded work and has changed working styles for virtually all professions and private activities that involve a computer."

"Aside from the monetary issues, librarians who involved themselves in the Initiative understood that information technologies were indeed important to ensure libraries' continued impact on scholarly work. Obvious opportunities lay in novel search capabilities, holdings management, and instant access. Online Public Access Catalogs (OPACS) constituted the entirety of digital facilities for many libraries. The partnership with computer science would contribute the expertise that was not yet widely available in the library community."

"core function of librarianship remains...information must be organized, collated, and presented."

hubs = web sites which direct visitors to other website that specialize on the hub's focus (sub topics are referred to as "authorities")

3) Lynch, Clifford A. "Institutional Repositories: Essential Infrastructure for Scholarship in the Digital Age" ARL, no. 226 (February 2003): 1-7. http://www.arl.org/bm~doc/br226ir.pdf

Institutional repositories = where universities may place material that research libraries deem worthy of collecting, also encourages "exploration and adoption of new forms of scholarly communication that exploit the digital medium in fundamental ways"

"It's vital that institutions recognize institutional repositories as a serious and long-lasting commitment to the campus community (and to the scholarly world, and the public at large) that should not be made lightly. In
establishing institutional repositories, institutions are both accepting risks and making promises; they are creating new expectations. In a budget crunch, the institutional repository may be one of the last things that can be cut, given the way that digital preservation demands steady and consistent attention and hence funding."

Monday, March 19, 2012

Week 9: Lab

Link for Week 9 lab: http://www.pitt.edu/~hmg24/lab9.html

Sunday, March 18, 2012

Week 10: Reading Notes

1) Tidwell, D. "Introduction to XML" http://www.ibm.com/developerworks/xml/tutorials/xmlintro/?S_TACT=104AHW06 developerWorks http://www.ibm.com/developerworks/xml/tutorials/xmlintro/xmlintro-pdf.pdf

XML = Extensible Markup Language, used to create your own tags

3 kinds of XML documents:
-Invalid documents don't follow syntax rules
-Valid documents follow both XML syntax rules and rules for their DTD or schema
-Well-formed documents follow the XML syntax rules but don't have either a DTD or schema

Comments = can appear anywhere in document, even before or after root element. Commets begin with .

Document Type Defition (DTD) defines elemnts which can appear in an XML document, the order they appear, and other basic details. Similar to SGML DTDs.

XML Schemas: have more power to define what valid XML documents look like

Document Object Model (DOM): defines set of interfaces to parsed version of XML document. Parser reads document and builds memory tree, so DOM can manipulate tree. Able to see what original doc contained, delete sections of the tree, rearrange the tree, add new branches, etc

2) Uche Ogbuji. A survey of XML standards: Part 1. January 2004. http://www-128.ibm.com/developerworks/xml/library/x-stand1.html

"Namespaces in XML 1.0 [W3C Recommendation] provides a mechanism for universal naming of elements and attributes in XML documents."

"XML Base [W3C Recommendation] provides a means of associating XML elements with URIs in order to more precisely specify how relative URIs are resolved in relevant XML processing actions."

"XML Inclusions (XInclude) 1.0 [in development] provides a system for merging XML documents. XInclude is generally used when you wish to split XML documents into manageable chunks."

"XML Information Set [W3C Recommendation], also known as the XML Infoset, defines an abstract way of describing an XML document as a series of objects, called information items, with specialized properties."

3) XML Schema Tutorial http://www.w3schools.com/Schema/default.asp

"An XML Schema:

defines elements that can appear in a document
defines attributes that can appear in a document
defines which elements are child elements
defines the order of child elements
defines the number of child elements
defines whether an element is empty or can include text
defines data types for elements and attributes
defines default and fixed values for elements and attributes"

"We think that very soon XML Schemas will be used in most Web applications as a replacement for DTDs. Here are some reasons:

XML Schemas are extensible to future additions
XML Schemas are richer and more powerful than DTDs
XML Schemas are written in XML
XML Schemas support data types
XML Schemas support namespaces"

XML Schemas: support data types, XML syntax, secure data information, etc.

Simple element: contains only text

<xs:element name="xxx" type="yyy"/>
where xxx is the name of the element and yyy is the data type of the element.

"Simple elements cannot have attributes. If an element has attributes, it is considered to be of a complex type. But the attribute itself is always declared as a simple type."

<xs:attribute name="xxx" type="yyy"/>
where xxx is the name of the attribute and yyy specifies the data type of the attribute.

Complex element: contains other elements or attributes

A complex XML element, "employee", which contains only other elements:

<employee>
<firstname>John</firstname>
<lastname>Smith</lastname>
</employee>

A complex XML element, "food", which contains only text:

<food type="dessert">Ice cream</food>

A complex XML element, "description", which contains both elements and text:

<description>
It happened on <date lang="norwegian">03.03.99</date> ....
</description>

Monday, March 12, 2012

Week 8: Lab

http://www.pitt.edu/~hmg24/index.html

Sunday, March 11, 2012

Week 9: Reading Notes

1) W3schools HTML5 Tutorial: http://www.w3schools.com/html5/default.asp

-HTML5 will be new HTML

-video, audio, and canvas features

-support for local storage: before, everything had cookies, but web storage is more secure and faster

-new forms

-Geolocation could provide geographical location of user if user allows it

-cache application: ability for offline browsing, loading speed, reduced server load

2) HTML5 Introduction: http://en.wikipedia.org/wiki/HTML5

-HTML5 is a mix of HTML and XHTML.

-Added video, audio, and canvas elements

-New types of forms: date/time, email, URL, search, etc.

3) W3schools XHTML Tutorial: http://www.w3schools.com/html/html_xhtml.asp

-cleaner version of HTML

-eXtensible HyperText Markup Language

Important consideration: "Today's market consists of different browser technologies. Some browsers run on computers, and some browsers run on mobile phones or other small devices. Smaller devices often lack the resources or power to interpret a "bad" markup language."

XHTML document contains three main sections:

1. DOCTYPE declaration

2. <head>

3. <body>

Sunday, February 26, 2012

Week 7 Lab

http://www.pitt.edu/~hmg24/index.html

Week 8: Reading Notes

1) W3 School Cascading Style Sheet Tutorial: http://www.w3schools.com/css/

CSS = Cascading Style Sheets

CSS was created to solve the problem of formatting tags, like font tags, to create style on an HTML page.

Styles are saved in external .css files, allowing the web developer to change the appearance or all the pages in a web site by editing only one file.

CSS declarations end with a semicolon and are grouped with curly brackets.

{color:green;

text-align:center;}

Comments are used to explain codes. They begin with /* and end with */.

External Style Sheets are ideal for many pages.

Internal Style Sheets are ideal for single documents.

2) CSS tutorial: starting with HTML + CSS http://www.w3.org/Style/Examples/011/firstcss

While the first tutorial is extremely in-depth, from basic to advanced CSS coding, I enjoyed this CSS tutorial more. Its step-by-step process involves the user more in actual coding rather than standard explanations with multiple chapter pages.

3) chapter 2 of the book Cascading Style Sheets, designing for the Web, by Håkon Wium Lie and Bert Bos (2nd edition, 1999, Addison Wesley, ISBN 0-201-59625-3)http://www.w3.org/Style/LieBos2e/enter/

Rule - a statement about one stylistic aspect of one or more elements

Style sheet - a set of one or more rules that apply to an HTML document

A rule contains two parts: selector and declaration. Selector is before the curly brackets, the declaration is within the brackets.

The selector specifies which part of the HTML document will be affected by the declaration, while the declaration is the rule that includes the effect, such as color.

h1 {color: red}

Fonts:

h1 {color: red;
font: 36pt serif}

or

h1 {font-style: italic;
font-weight: bold}

Tuesday, February 21, 2012

Week 6 Lab

Barcode: 2012-1234193

Sunday, February 19, 2012

Week 7: Reading Notes

1) W3schools HTML Tutorial: http://www.w3schools.com/HTML/

I already know basic HTML, but this tutorial was a nice review of paragraphs, headings, start tags and end tags. As the tutorial became more in-depth in basic formats, I appreciated the refresher course on more difficult codes like frames, which I would always need to look up in the past.

<html>

</frameset>

</html>

The advanced, media, XHTML (or XML), and reference sections are great resources for future reference!

2) HTML Cheatsheet http://www.wired.com/images/multimedia/webmonkeycheatsheet_full.pdf

This cheatsheet is also a good reference, but I would prefer looking up codes in the previous tutorial, as you could easily see how they work (and quickly test them) all within one or two pages.

3) Goans, D., Leach, G., & Vogel, T. M. (2006). Beyond HTML: Developing and re-imagining library web guides in a content management system. Library Hi Tech, 24(1), 29-53.

Options:

-FrontPage

-Open source

-In-house

I understand databases are used to keep track of updating library web guides, but how they connect to formatting is still a difficult concept for me. Whether one uses the content management system itself, or code in FrontPage or another format, is up to the librarian/library. I would probably better understand how the transition occurs in a real-time process, or shadowing someone like a web development librarian, a title referenced in the article.

Monday, February 13, 2012

Week 5 Lab

Task 1:

Task 2:

http://www.pitt.edu/~hmg24/index.html

Sunday, February 12, 2012

Week 6: Reading Notes

1) Tyson, Jeff. http://computer.howstuffworks.com/internet-infrastructure.htm/printable

-No one owns the Internet.

-The Internet grew from four host computers in 1969 to tens of millions computers today.

-The Internet Society formed in 1992 to develop policies and protocols to define our actions online.

-Computer Network Hierarchy

-Domain Name System (DNS) was created by University of Wisconsin in 1983 to map text names to IP addresses, so a user only needs to remember the website name--not its IP address.

-URL: Uniform Resource Locator

-HTTP: Hypertext Transfer Protocol

2) Andrew K. Pace “Dismantling Integrated Library Systems” Library Journal, vol 129 Issue 2, p34-36. 2/1/2004 http://www.libraryjournal.com/article/CA374953.html

-Competitive ILSs
-Starting over, comparing moderate user levels to servicing thousands over the web
-ILS versus "Internet-savvy end users"
-Open Source Software (OSS) as solution?
-"Library systems are changing because library assets are changing..." - Verne Coppi, Endeavor's VP for development

3) Sergey Brin and Larry Page: Inside the Google machine. http://www.ted.com/index.php/talks/sergey_brin_and_larry_page_on_google.html

-"Pretty much everywhere there's power, there's the Internet." - on why there are so many Google queries around the globe, although they need to work on Africa.
-Algorithms are not perfect or smart.
-Everyone has access all around the world? (Noted earlier in lecture that Africa needs help...but so do countries with censorship issues since lecture was made in 2004. Does Google need to update this lecture with digital divide issues?)
-Responsibility to provide people with the right information.
-Never accepts payment for search results, just makes money from advertising.
-Thinks like a "newspaper."
-Tremendous impact on the world.

4) A Few Thoughts on the Google Books Library Project http://www.educause.edu/EDUCAUSE+Quarterly/EDUCAUSEQuarterlyMagazineVolum/AFewThoughtsontheGoogleBooksLi/162510

-The author marvels at the prospect of having so much digital information at our disposal.

-Author's main argument: "Google's initiative will not make books obsolete; it will make the information in them more widely accessible."

-Fascinating, if not true, concept: "Once users have become accustomed to typing in queries, they want to know why they can't find everything in a like manner."

Monday, February 6, 2012

Week 4 Lab

Task 1:

-Find journal impact factor for Annu Rev Inform Sci in the year 2007.

-MySQL command: SELECT * FROM isi_jcr_report_isls WHERE j_abbr = 'ANNU REV INFORM SCI' AND jcr_year = 2007

-Answer: Impact Factor is 1.936

Task 2:

-Find all journals with impact factor > 1 in year 2008.

-MySQL command: SELECT j_abbr, j_if FROM isi_jcr_report_isls WHERE jcr_year = 2008 and j_if > 1

-Answer: See figure below.

Sunday, February 5, 2012

Week 5: Reading Notes

1) Local Area Network: http://en.wikipedia.org/wiki/Local_Area_Network

-Local Area Network = LAN
-Connects computers in a limited area, such as a home, school, or office
-Early LANs used coaxial cables, later models used wi-fi, as less cabling was needed and it was more essential for mobile laptops and phones
-LANs might also be classified as a Metropolitan Area Network (MAN) or Wide Area Network (WAN) depending on distance involved and how connections are established

2) Computer network: http://en.wikipedia.org/wiki/Computer_network

-Computer network: refers to hardware components and communication channels which share information
-Communication protocols: define rules and data formats for exchanging information

Computer network properties:
-faciliate communications
-permit sharing of files, data, and other types of information
-share network and computing resources
-may be insecure
-may interfere with other technologies
-may be difficult to set up

-wired technologies v. wireless technologies v. exotic technologies

Communications protocols:
-Ethernet
-Internet Protocol Suite (or, TCP/IP)
-SONET/SDH
-Asynchronous Transfer Mode
-Network programming

Scale examples:
-Personal Area Network
-Local Area Network
-Storage Area Network
-Campus network

Network topology: layout of the network

Network performance: service quality

3) Coyle, K. (2005). Management of RFID in libraries. Journal of Academic Librarianship, 31(5), 486-489.

"Briefly, the RF in RFID stands for “radio frequency”; the “ID” means “identifier.” The tag itself consists of a computer chip and an antenna, often printed on paper or some other flexible medium. The shortest metaphor is that RFID is like a barcode but is read with an electromagnetic field rather than by a laser beam. The similarity ends there. RFID is an advanced technology compared to barcodes" (486)

-hundreds of different RFID tags on the market today, not a singular technology
*card swipes
*farm animal trackers

Should libraries used RFID?
-Privacy is a large concern
-Technology isn't highly secure
-Best to debate pros and cons and develop policies should RFID becomes a technological standard in the future

Monday, January 30, 2012

Week 3 Lab

http://www.citeulike.org/user/heatherguenther

Sunday, January 29, 2012

Week 4: Reading Notes

1) Database: http://en.wikipedia.org/wiki/Database

Databases - organized collections of data, referring to the data itself, not a database management system (DBMS)

Popular DBMSs: Oracle, Microsoft SQL Server, My SQL, and SQLite

Database languages: data definition languages (DDL), data manipulation languages (DML), query languages

Popular database language: SQL - which combines functions of DDL, DML, and query languages

Relational Model
-applications should search for data by content, not by following links
-somewhat limited depending on data type, relational model venders extended their services to support a larger variety of data types

General-Purpose DBMSs
-not always optimal, when considering certain specialized jobs
-DBMS developers, application developers, database administrators, and application end-users are those involved with a general-purpose DMBS

Examples of Database Types:
-Active database
-Cloud database
-Data warehouse
-Distributed database
-Document-oriented database
-Embedded database

Data Models: provides a way to use data structures needed to model an application
-Hierarchical
-Network
-Relational
-Entity-relationship

Database Architecture: may be considered an extension of data modeling.
-External level - how each end-user understands data organization
-Conceptual level - takes all external views and organizes into one coherent view
-Internal level - concerned with database implementation

Database Security
-Access control
-Data security
-Database audit

DBMS Architecture: specifies components and their interfaces
-external interfaces
-database language engines
-query optimizer
-database engine
-storage engine
-transaction engine
-DBMS management and operation component

Database Transactions
-All transactions obey the following rules: ACID (Atomicity, Consistency, Isolation, Durability)

2) Entity relationship model in database: http://en.wikipedia.org/wiki/Entity-relationship_model

Entity relationship model (ER) - abstract and conceptual representation of data
-ER diagrams are drawn with “rectangles to represent entities, and diamonds to represent relationships.”

Semantic Model - a model of concepts

Crow’s Foot notation - boxes (instead of rectangles) and lines (instead of diamonds)

3) Database Normalization Process: http://www.phlonx.com/resources/nf3/

Database Normalization Process:

Three forms to memorize:

No repeating elements or groups of elements

No partial dependencies on a concatenated key

No dependencies on non-key attributes

Sunday, January 22, 2012

Week 3: Reading Notes

1) Anne J. Gilliland. Introduction to Metadata, pathways to Digital Information: 1: Setting the Stage http://www.getty.edu/research/conducting_research/standards/intrometadata/setting.html

-Metadata is "data about data."
-Metadata isn't a familiar a term to basic users, although they "are increasingly adept at creating, exploiting, and assessing user-contributed metadata such as Web page title tags, folksonomies, and social bookmarks."
-All information objects have content, context, and structure.

Data structure = MARC
Data value (controlled vocabularies) = LCSH (Library of Congress Subject Headings)
Data content (cataloguing rules) = AACR, RDA
Data format/technical interchange (manifestation of a data structure) = MARC21, MARCXML

-Context is important to archivists, especially in a museum setting.
-More to metadata than description and resource discovery? (i.e. exhibition catalogs, acquistion records, licensing agreements, educational metadata)
-user-created metadata, folksonomies

Different types of metadata:
-Administrative
-Descriptive
-Preservation
-Technical
-Use

Why is metadata important?
-Accessibility
-Retention of context
-Expanding use
-Learning metadata
-Legal issues
-Preservation

2) Eric J. Miller. An Overview of the Dublin Core Data Model http://dublincore.org/1999/06/06-overview/

-Dublin Core Metadata Iniative (DCMI) strives for consensus for discovery-oriented descriptions across disciplines.

DCMI requirements:
-Internationalization
-Modularization/Extensibility
-Element Identity
-Semantic Refinement
-Specification of controlled vocabularies
-Identification of structured compound values

3) Working with Endnote, http://www.hsl.unc.edu/Services/Tutorials/ENDNOTE/intro.htm

-bibliographic software program
-select a reference library
-choose different citation styles
-sort, find and view references
-Cite While You Write (CWYW) is an Endnote feature accessed in Microsoft Word. You can insert citations at any time during the writing process.
-Instant Formatting

Sunday, January 15, 2012

Week 2: Reading Notes

1) Computer Hardware: http://en.wikipedia.org/wiki/Computer_hardware and Computer Software: http://en.wikipedia.org/wiki/Software

Mini Computers

-middle range of computer systems (between large multi-user systems such as mainframe computers and small single-user systems such as microcomputers or personal computers)

-also known as “mid-range computer”

-in 1960s, minicomputers usually took up the space of one or two refrigerators, compared to a mainframe system taking up an entire room

-”The first successful minicomputer was Digital Equipment Corporation’s 12-bit PDP-8, which cost from US$16,000 upwards when launched in 1964.”

-In the 1990s, change began to move from minicomputers to inexpensive PC networks.

-Microsoft Windows, beginning with Windows NT, supported multitasking and other features required for servers

-“…although today’s PCs and servers are clearly microcomputers physically, architecturally their CPUs and operating systems have evolved largely by integrating features from minicomputers.”

Personal Computers

-Personal Computers, or PCs, are meant for the average end user

-Software on PCs include: word processing, spreadsheets, databases, Web browsers, e-mail clients, digital media playback, games, etc.

-Early PC users had to write their own programs to do anything with their machines.

-"Since the early 1990s, Microsoft software and Intel hardware have dominated much of the personal computer market, first with MS-DOS and then with the Wintel platform. Alternatives to Microsoft's Windows operating systems include Apple's Mac OS X and the open-source Linux OSes. AMD is the major alternative to Intel's central processing units.”

-"In July and August 2011, marketing businesses and journalists began to talk about the 'Post-PC Era', in which the desktop form factor was being replaced with more portable computing such as netbooks, Tablet PCs, and smartphones.”

History

-"In what was later to be called The Mother of All Demos, SRI researcher Douglas Engelbart in 1968 gave a preview of what would become the staples of daily working life in the 21st century - e-mail, hypertext, word processing, video conferencing, and the mouse. The demonstration required technical support staff and a mainframe time-sharing computer that were far too costly for individual business use at the time.”

-Early PCs, or microcomputers, were mostly of interest to hobbyists and technicians. “Practical use required adding peripherals such as keyboards, computer displays, disk drives, and printers.”

-In 1982, “The Computer” was named Machine of the Year in Time Machine.

Hardware

-Mass-market computers use standard components and are easily assembled.

-“A typical desktop computer consists of a computer case which holds the power supply, motherboard, hard disk and often an optical disc drive. External devices such as a computer monitor or visual display unit, keyboard, and a pointing device are usually found in a personal computer.”

-"The motherboard connects all processor, memory and peripheral devices together. The RAM, graphics card and processor are mounted directly onto the motherboard. The central processing unit microprocessor chip plugs into a socket. Expansion memory plugs into memory sockets. Some motherboards have the video display adapter, sound and other peripherals integrated onto the motherboard. Others use expansion slots for graphics cards, network cards, or other I/O devices. Disk drives for mass storage are connected to the mother board with a cable, and to the power supply through another cable. Usually disk drives are mounted in the same case as the motherboard; formerly, expansion chassis were made for additional disk storage.”

Mass storage

-Floppy drive…zip drive…optical drive…different methods of storage appeared as memory sizes increased.

-USB drive is typically the external storage device used today

-“The operating system (e.g.: Microsoft Windows, Mac OS, Linux or many others) can be located on any storage, but typically it is on a hard disks. A Live CD is the running of a OS directly from a CD. While this is slow compared to storing the OS on a hard drive, it is typically used for installation of operating systems, demonstrations, system recovery, or other special purposes."

Operating System

-“An operating system (OS) manages computer resources and provides programmers with an interface used to access those resources.”

-Common OSs: Microsoft Windows, Mac OS X, Linux, Solaris and FreeBSD.

-“Unix was developed at Bell Labs beginning in the late 1960s and spawned the development of numerous free and proprietary operating systems.”

-“Linux is a family of Unix-like computer operating systems. Linux is one of the most prominent examples of free software and open source development: typically all underlying source code can be freely modified, used, and redistributed by anyone.”

Computer Architecture

-"In computer science and engineering, computer architecture is the practical art of selecting and interconnecting hardware components to create computers that meet functional, performance and cost goals and the formal modeling of those systems.”

-“The noun computer architecture or digital computer organization is a blueprint, a description of the requirements and basic design for the various parts of a computer. It is usually most concerned with how the central processing unit (CPU) acts and how it accesses computer memory. Some currently (2011) fashionable computer architectures include cluster computing and Non-Uniform Memory Access.”

Performance

-“In a typical home computer, the simplest, most reliable way to speed performance is usually to add random access memory (RAM). More RAM increases the likelihood that needed data or a program will be in RAM. So, the system is less likely to need to move memory data from the disk. The disk is often ten thousand times slower than RAM because it has mechanical parts that must move to access its data.”

Operation
-CPUs execute instructions called a program. The four steps of operation are typically: fetch, decode, execute, and writeback.

Computer Data Storage

-core function of computers

-primary storage (internal memory)

-secondary storage (external memory)

-off-line storage (data that is recorded in a secondary device, then disconnected from the machine)

Input/Output

-“In computing, input/output, or I/O, refers to the communication between an information processing system (such as a computer), and the outside world, possibly a human, or another information processing system. Inputs are the signals or data received by the system, and outputs are the signals or data sent from it."

http://en.wikipedia.org/wiki/Software

-Computer software: a collection of computer programs and related data

-Alan Turing, in 1935, proposed first theory about software

-"Colloquially, the term is often used to mean application software. In computer science and software engineering, software is all information processed by computer systems."

-System software are usually basic functions for average computer use, assisting the computer to function. Includes: device drivers, operating systems, and utilities.

-Programming software assists programmers in writing computer programs. Includes: compilers, debuggers, and text editors.

2) Scanners and Digitization: Stuart D. Lee. Digitization: Is It Worth It? http://www.infotoday.com/cilmag/may01/lee.htm

-“…the real cost of digitizing and delivering a printed black-and-white, letter-sized page at 300 dpi 1-bit could be as high as 54 cents.”

-“…in summary the listed advantages offered by digitization tend to come under the headings of increasing access, preservation, and meeting strategic goals (i.e., raising the profile of the institution running the project, and so on)”

-“Above all we should not be forgetting that our primary aim is to meet the requirements of the readers and to provide them with the resources they really need to use.”

**The debate on digitization is ongoing, but the author’s last point—considering the readers’ needs—is sound advice for the future of the field.

3) Doreen Carvajal. European libraries face problems in digitizing. New York Times. October 28, 2007 http://www.nytimes.com/2007/10/28/technology/28iht-LIBRARY29.1.8079170.html

-lack of funding for digitization

-considering different business models to pay for costs

-possibility of entering into a contract with Google for digitization projects

-a feeling of hesitancy among libraries and museums for entering into contracts with corporations, in order to protect full public access…the downside is that money is still needed to continue public access

4) Data Compression. http://en.wikipedia.org/wiki/Data_compression

-encode with fewer bits than the original file would use

-“Compression is useful because it helps reduce the consumption of expensive resources, such as hard disk space or transmission bandwidth. On the downside, compressed data must be decompressed to be used, and this extra processing may be detrimental to some applications.”
-Audio compression reduces bandwidth of digital audio streams and the storage size of audio files.
-Video compression: most is lossy, DVDs use a standard coding called MPEG-2

-“Most video compression is lossy: it operates on the premise that much of the data present before compression is not necessary for achieving good perceptual quality. For example, DVDs use a video coding standard called MPEG-2 that can compress video data by 15 to 30 times, while still producing a picture quality that is generally considered high-quality for standard-definition video.

Tuesday, January 10, 2012

Intro to Information Technologies

Stay tuned for some exciting class notes!