Tag Archives: analytics

Statistics from Media Lab and Reference

  • Media Lab:  Has no stats that would be of use to us
  • Reference:  Has no stats that would be of use to us, but when I asked Bryan about ref stats he suggested EZProxy stats, so…
  • EZProxy:  Has no stats that would be of use to us YET.  Later, we should consult these.  They can tell us about subscription database use, including what databases are most commonly used and how many of our database users log in from on-campus vs off-campus.  They can tell us more than that too, but to get more we might need help from someone like Ian!  Barbara can explain better than I can.  Mike Lynch will be a good resource too.  If you want a look, go to: \orgs\LIS\LISstaff\ILS III Millennium User Materials\EZProxy statistics\+START HERE 2009.html

HEAT – gathering statisitics that can enlighten us

I have begun an attempt to query HEAT for any useful statistics that could aid us in our mission.  Since HEAT has a unique way of structuring its data in its semi-relational model it has been challenging to pull helpful information.  One primary reason for this is that the important information we would want to know about (e.g. sites, problem/request details, etc…) are in free-text fields which cannot be queried easily.  Another obstacle is that there seem to be a variety of methods currently (historically) used to categorize Web Site specific events.  I spent a few hours this week getting to know the database structure and I should be able to provide some statistics (qualified) by early next week.  If anyone has suggestions on the type of information/stats we could pull please send them my way.  For now I will look for the obvious stats and we can drill into HEAT further as we discover more about HEAT’s dB and where the data is stored.

 

Google Analytics for LIS

Before I begin with the data, I have to explain for website redirection works. We have a number of pages on the CMS that redirect the user to another page. An example is the link to MIDCAT on the LIS home page. The redirection is done by sending what is called an HTTP Header from the web server to the client. Basically, all web responses from a server come in two parts: the headers, which supply some metadata to the client about their request, and the body, which contains the content. When a server instructs a client to go to another location, via a redirect, it might send a header of type 302 and the location of the new site. The client then makes a request for the new site. Importantly, this is all done before the body of the original site is loaded by the client.

This is important because of the way Google Analytics works. The code to update the analytics data for the site is contained at the end of the body and is only run if the entire page loads. This means that when we do a redirect, the user’s click on that link is not recorded by our analytics tools. Because of this, you will see a 0 next to that link to MIDCAT on the LIS home page. It would be silly of us to assume that this link is never clicked on, but according to GA, it wasn’t. I’ve noted in the spreadsheet at the end of this post whenever a link on a page is a redirect. We could use standard log file analysis to determine the exact number of page visits to all of these, but as I explained earlier, we don’t have the log files because I deleted them all.

This is not to say that we should discount the results of the data here. There is still useful information in GA. However, when looking at the data, we need to take into account that there are links on each page for which we don’t have information.

Now on to the analysis! The following data is for the last year (July 26, 2008 – July 26, 2009). I figured that a full year would give us the most robust view of the site traffic.

I’ll start with the total page views for each major area of the LIS site. Unsurprisingly, the Library content is the most viewed, with the Contact, Quick Links, and Teach/Learn sections barely viewed at all in comparison. The Unique Pageviews bear out this pattern and the full table of data is available in the spreadsheet at the end of the post.

Two questions we agreed to ask of the Google Analytics tool were “What are the top five links on the LIS home page?” and “What are the five least clicked links on the LIS home page?” Here are the top five:

  1. Library & Information Services (6539)
  2. Articles, Indexes, Research Sources (6160)
  3. Find Books, Articles & More (4289)
  4. Middlebury (Home Page) (3175)
  5. Course Reserves (3059)

Here are the five least clicked:

  1. Content Provider (LIS website) (0)
  2. P2P Music & Video Filesharing (0)
  3. Telephone Services (0)
  4. Digital Media Development (2)
  5. Departmental & Workgroup Directories (3)

The full lists, as well as the lists for the landing page of each sub-site are contained in the spreadsheet at the end of this post.

One question that I was not able to answer was about click paths to the blogging and wiki sites. This question can’t really be answered for a couple reasons. First, we don’t have a full year of data for those other services and, second, links to those services would go through redirect posting which are flawed for the reasons described above. If there are sites within the LIS site that you would like to see click-path analysis for, please let me know specifically what resources we should examine.

Here are the broad recommendations I draw from this analysis:

  1. The Quicklinks section gets little traffic. This should be removed from the site IA. A better designed site with the addition of a search landing page should not need this section.
  2. Other than the link to the CTLR, the Teaching & Learning site is not used. Rather than try to have this information live in multiple places, any relevent content should be moved from here to CTLR and this sub-site should be replaced with a simple link to CTLR.
  3. The Hours, Locations, Maps section is really the only think people click on About LIS to see. We should have this information on the LIS search landing page and move the remainder of the content elsewhere.
  4. The Departmental Directory is not used, but the full Staff Directory is used. Eliminate both and replace with a link to the LIS search results in the central Directory, which is the same information as the Staff Directory. (Forthcoming improvements to the central Directory interface will improve this experience).
  5. The Quick Phone and Email Contacts is the primary resource in Contact LIS. Eliminate the rest of the sub-site and add this information to the LIS search landing page.
  6. The links in the top sections of the sub-site landing pages such as below “Need help?” on the Library site are rarely clicked. Avoid this interface in future site designs.
  7. The “Did you know…?” sections of the LIS site are rarely visited. Consolidate this information into a central blog about LIS.

Here is the complete spreadsheet of results.

Survey results – summary

How important is it for you to reach each of these audiences with your web content?

Students, faculty, LIS staff = “very important”
Other staff = “quite important”
Alumni, other = “less” to “unimportant”
________________________________________________________________
How often do YOU use these LIS website features?

Often: catalog

Sometimes: helpdesk documentation, LIS staff directory, Newsletters, LIS blogs, LIS Wiki

Never: subject guides

Didn’t know existed: NONE! good.
_______________________________________________________________
What features of the LIS website work or could be improved?

Work: research guides

Need improvement: help documentation, website searching, LIS org structure, LIS depts staffing info, services/depts loctions, hours of service, space availability, who to contact for what

Unfamiliar: staff accomplishments, status of systems, LIS events calendar, tagging

3-way tie: emergency procedures
_________________________________________________________________
Do users often contact you for information that is available on the website?
yes = 65%  e.g. hours, documentation.
_________________________________________________________________
Is there information not on the LIS Website that would be useful to include?
yes = 58% (22 of 42 responses)- no theme of what’s missing is apparent.
Some comments on difficulty of finding what’s there already.
__________________________________________________________________
What is your level of comfort with the following technologies?
Most to least comfortable, overall –
Blogs, Wikis, IM, CMS, RSS, Surveying
___________________________________________________________________
What specific areas of the current site do you feel are successful? Why are they successful?
MIDCAT gets a few mentions. Documentation. Some comments apply to College website not LIS.
_________________________________________________________________
Is there anything else you’d like the LIS Website Team to know about?

22 responses – most common theme – need to improve access to info already there! “info is buried” “difficult to find things” etc.
____________________________________________________________________

Please rate your overall experience in navigating the LIS website.

Frustrating = 52%  OK = 48%  Love it = none.

Should be possible to show improvement on this metric.

Search Statistics from the GSA

The Google Search Appliance lets us create “collections” of portions of the site that can be searched. These collections are what you see in the drop-down field on http://search.middlebury.edu. The LIS collection is also the one being searched if you enter a query on the LIS or Library home pages on the CMS. This collection is searched much, much less frequently than the main search, however we may find the results interesting.

Here are the top 15 queries of the LIS search collection in the last year:

jstor 51
tigercat 15
eres 11
psychinfo 10
special collections 9
mla 8
citation 8
JSTOR 8
oxford english dictionary 8
lexis nexis 7
thesis 7
printing 7
library hours 7
music library 7
segue 6

And here is the same for the LIS Wiki collection:

novell 2
server 2
tigercat 2
computer upgrade schedule 1
inurl:pdf 1
proprietary name 1
freeze 1
video 1
antivirus software 1
Novell, Tigercat 1
How to use Novell and Tigercat 1
wordpress 1
blog 1
after graduating 1
Controller’s Office 1

For comparison, and so you can see how infrequently the LIS search collection is used, here are the top 15 LIS-related queries from the “All” collection, which as you might expect indexes all the other collections:

segue 1152
library 388
email 306
tigercat 246
webmail 214
bannerweb 163
library hours 129
INB 119
jstor 108
inb 103
eres 102
banner web 83
netstorage 77
banner 72
computers 71

(Note on this last one: midcat is the 16th term with 70 queries).

You can see the full reports for each, which include more information at these on-campus-only links:

Report for LIS Collection 07/24/2008-07/24/2009

Report for LIS Wiki Collection 07/24/2008-07/24/2009

Report for All Collection 07/24/2008-07/24/2009

I also sent an email to Chris Norris asking for assistance getting us some information from Google Analytics. He was out yesterday and today, but appears to be in his office next week, just booked straight through with meetings. I’ll keep you posted on this item. Here is the list of questions I sent him:

1. What are the top 5 search terms within the LIS? (I’ll get this from the GSA)

2. What are the top 5 pages on the LIS site?

3. What is the most common click path from /academics/lis to the LISt blog and the LIS Wiki?

4. What links on /academics/lis (the landing page) are clicked on the least?

5. Same as (4), but applied to /academics/lis/lib, /academics/lis/help, and /academics/lis/about.

Recommendations: Preparing a Document to Share

I started the LIS Website Recommendations document a few weeks ago, based on my personal views of the project and some of the discussions we have had early on during team meetings. I will commit to working later in this week to flesh this document out more with specific references to materials which support the changes it recommends. However, here are some framing questions we can use to think about these recommendations.

What sections of this document conflict with goals of the team?

What goals of the team are missing from this document?

What changes do we need to make to these recommendations based on the survey results?

What changes do we need to make to these recommendations based on other data sources?

What resources should we reference within this document to support the recommendations it proscribes?

Criteria: What do we need for a LIS website?

This is a place to coordinate discussion about the criteria we desire for the LIS site. We’ve created two sections in the wiki to store this information:

How can we combine these sections? Do we need to combine these sections?

What information from our own personal views are missing from these sections?

What information from our survey results is missing from these sections?

What information from the other analytics and analysis we’ve conducted is missing from these sections?

What information in these sections should be removed or is no longer applicable knowing what we know now?

How do these criteria allow us to meet our goals and achieve our vision?