December 10

Print Code and Citation Information Data Share

As a follow up to yesterday’s post about copyright issues with state codes, the data that I collected on print state versions can be found on this google drive spreadsheet.

I also went through all of the states’ citation rules, because I was curious about citation requirements to commercial vs. public domain works as well as requirements to use The Bluebook (a proprietary citation system) itself by states.  So that data is also in that spreadsheet.  My main takeaways from that are:

  • 11 states require use of the Bluebook (in whole or part)
  • 22 states specfically require citation to West National Reporter System cases (instead of using the cite you can find, for example, via Google Scholar.)
  • 16 states use public domain citation formats, although many also require parallel cites to West NRS cases.  Of course, anything before the change over to Public Domain will require cite to a non-vendor or media neutral source.

I want to go back through the citation rule information and see how many states have their own citation manual, because it could be inferred that those that don’t are also Bluebook by default.  Although I did see instances of “any accepted citation format” as a guide, so in theory they could use Peter Martin’s free and open Basic Legal Citation.  I also want to get an exact count of how many of the Public Domain states really do require citation to commercial cites.  Sort of kicking myself that I didn’t think to keep track when I was doing it, although with only 16 states, it should be pretty easy to accomplish.

December 9

Copyright Issues with State Codes

CVyw_FcWsAAVgU8.jpg largeHello, Gentle Reader!  Long time no post!  I’m in the process of writing up my research results from my survey/census of state published legal information.   You can get a preview of my findings in my recent Slaw.ca column “What Do You Mean the Law is Closed?” or this slide show that illustrates that post with some of the data included.  I didn’t make that slide show explicitly to go with the post – I’ve been traveling and I presented on my research using that deck.

One of the topics that I’m covering in my coming research report is the copyright…well, confusion, frankly….that exists with state codes. There are so many copyright notices on state legal information webpages!  It’s not entirely clear if they mean the legal info content or if it’s just something always stuck in the boiler plate of the webpage and they don’t really mean it or something in between.  Although I’m generally trying to stick to web-based publishing of law, I thought that before I left the HLS Mothership for the holidays, I’d check out their print collection of codes and see what the situation was there in hopes that would clear some things up.

SPOILER ALERT. IT DID NOT CLEAR THINGS UP FOR ME.

It did, however, find provide some fascinating data points.  For various definitions of the word ‘fascinating.’

For “official” print codes, I found the following numbers:

  • 4 – No Claim of Copyright by anyone
  • 22 – State Claims Copyright
  • 10 – Thomson Reuters (or some subsidiary thereof) Claims Copyright
  • 9 – LexisNexis (or some subsidiary there of, usually Mathew Bender) Claims Copyright
  • 3 Shared Claim of Copyright between State and Publisher

Sharp eyed readers will note that this only adds up to 48 codes.  That’s because some states have designated their online code to be official and some states have two official codes.

BUT HERE’S WHAT I FOUND TO BE REALLY INTERESTING….

Most of the codes are annotated, so it’s entirely possible that the claim of copyright is referring to the annotations.  Whether or not that is kosher is currently being decided in State of Georgia v. Malamud.  BUT BUT BUT…  Seven codes – Connecticut, Idaho, Minnesota, Nebraska, Nevada, South Carolina, South Dakota,  and Washington – are UN-annotated and yet there still is a state claim of copyright slapped on them.  So, unless I’m making a crazy assumption here, that means that these states are claiming copyright on something that is a clear edict of government (and thus public domain.)

Harvard has more than just the official codes in their collection, so in for a penny, in for a pound and I went through 64 state codes in total.  The numbers for them ended up being:

  • 4 No Claim of Copyright by Anyone
  • 23 State Claims Copyright
  • 21 Thomson Reuters (or some subsidiary thereof) Claims Copyright
  • 13 LexisNexis (or some subsidiary thereof) Claims Copyright
  • 3 Shared Claim of Copyright between State and Commercial Publisher.

Like I said, it’s confusing to know exactly what these copyright notifications are claiming copyright on.   Annotations, Section Headings, the text of the law itself… ¯\_(ツ)_/¯  However, in 3 of these codes, there was a disclaimer from the publisher (Matthew Bender each time) that they were not claiming copyright in the statutes, case quotes, etc., just the annotations.  So that I was nice.   And in one of the codes, the publisher (Thomson Reuters) said that they were only claiming copyright in the annotations and that the state had copyright in statutes.  So, score one for being clear, I guess…

And finally, in “things I didn’t realize I had to be annoyed about”, I found that Thomson Reuters (or some subsidiary thereof) fairly often had a trademark on names such as “Iowa Code Annotated.”  So if, for example, I wanted to publish my own annotated copy of the Iowa Code, I guess I would run into trademark issues with finding a clear name for it?   I’m honestly not an IP expert, so I need to think and research more about that, but on first pass/gut instinct, I thought there were some rules about trademarking common-ish names.

 

September 4

Activity Summary – Week Two

Made it to Friday of my first full week of work!  Yay me!   So what have I done this week?

I made it through the first pass of reviewing all 50 states (and DC) online codes. My goal was to have all of the state code data collection project done this week, so I’m pretty much on schedule. Yay me.*  About halfway through I decided I needed to have separate entries for the actual name of the code and the name of the official/certified version of the code.  (Because they’re not the same thing. Of course they’re not.)  I hope to fill that in, as well as a few unknowns in my datasheet by the end of business today.  That means I get to spend today in the Harvard Law Library Reading Room using the AALL State Bibliographies, which will hopefully help solve some mysteries for me.

After I get everything sorted I’ll make it all public.  I really want to jump into the data collection on case law, so any in depth analysis of the code information will probably wait.   I can say now that one way my thinking has changed is that – for some reason –  I used to think that only state websites could/should be the publisher of official digital copies of law.  However, something about seeing the number of privately published print official/certified codes has caused me to change my mind.  Why can’t a State Decoded website be certified too?

(I have vague ideas about how blockchain technology could be implemented ensure absolute accuracy and chain of custody with regards to code data without relying upon PDFs. But that’s another post for another day.  And I’m sure they’re are other ways of doing this. Hey hey, who’s got a research project and a couple of months to spare doing research?  This gal right here.  But I digress.)

Speaking of AALL, I was reminded this week of the Digital Access to Legal Information Committee.  They are the ones in charge of the 50 state surveys, which I am modelling my data collection on.  Yay for more people being interested in this type of information and future possible collaborators.

*It’s interesting being in charge of my own schedule and deadlines.  I like it.  Surprisingly, no guilt thus far about not working hard enough or fear about setting the bar too low.  I think maybe because it wasn’t that long ago that getting out of bed and getting dressed consisted of a victory for me that I’m in no rush to set myself up for failure.

Speaking of success, the Berkman stuff will finally start to ramp up next week.  There was an email chain of all the new/returning fellows/affiliates/associates etc introducing themselves.  Holy Impressive Lineup, Batman.  I was hit with a mixture of impostor syndrome and “I don’t think we’re in Kansas anymore, Toto.”  My parents like to kid me about being a librarian rockstar, but there are like, actual rockstars on this list.   And, oh, by the way, US Supreme Court justice coming to talk to us.   It’s intimidating and exciting and I can’t wait to learn more from the community.  I’m still not entirely sure how I am lucky enough to be doing this.

On the skills enhancement side of things, I started to work my way through the Code Academy web stuff.  All refresher thus far and pretty easy.    I was told by several people, quite understandably, that instead of asking what programming language I should learn, I should have a project in mind and build it and that will teach me what language to learn.  But that doesn’t quite answer the question about which language will do what I want – I don’t want to start building something in, say, Ruby, only to find that actually Python would have been better.  Does taht make sense?    At any rate, I think I’m going to start with the python modules on code academy.

I also want to highlight this blog post I read this week:  Libraries’ Tech Pipeline Problem.  Definitely reinforced my thinking that I have a great opportunity here to teach myself some new technical skills.

On a personal note, settling into Cambridge nicely.  My apartment – a small fully furnished one – is finally starting to feel a little like home/my space, even though I’m surrounded by other people’s stuff.  I’m also starting to get some routines in place, which are always comforting.   I’m definitely learning to embrace simplicity, just due to the size of my apartment and the fact that I couldn’t bring much stuff with me.  It’s nice.  I guess I could insert a reference to local boy Thoreau here, but I’ll pass on the temptation.    My plan for the weekend is to get the heck out of Boston one day – probably Salem because I know I definitely want to see that and soon it’ll be too close to Halloween to deal with – and maybe one day go down town and ride a double decker bus tour just to get the lay of the land.  I’ve been to Boston quite a few times in the past few years, but it seems like all I saw was either the convention center or Harvard Law School.

August 28

Codified State Statutes

First things first…I’m going to try and get my arms around what the current crop of law published online by the creating government looks like.    Seems simple enough, right?    Actually, I don’t think it will be too bad.  Just tedious and, if the few websites I looked at today are any indication,  heartbreaking.

I’m going to stick initially with just basic primary state level material – codes, case law and regulations.  (Although with case law, I think I may go county by county – all 3143 of them – so that for later data manipulation I can get everything with the correct state and federal jurisdictions.  Because I just know even without looking into it that federal and state judicial districts will not overlap nicely.  BUT ANYWAY.. enough about cases for now. )  At this state level investigation,  I’m also going to include DC and the inhabited territories, so that brings us to 56 geographic entities for codes and regulations.

So what are the data points I’m going to collect?  I think I came up with a good balance of COLLECT ALL THE DATA and things that are actually important to know.  Since I don’t have a final product firmly designed for this outside of using it for research and comparison, I may have erred on the side of “collecting too much” but if I’m going to spend time on this, I want to make sure that I’m Doing It Right.  As you can probably tell, my list has also been greatly informed by the recent problems with the Georgia State code.    Here are the data points I’m going to collect for codes.

  • Name of official version of code
  • Is official version annotated or unnannotated?
  • Publisher of Official Code
  • Online version URL
  • Is online version official?
  • Statement of officialness?
  • Format of Online Version
  • Publisher of Online Version
  • Use restrictions on Online version
  • Statement of Copyright
  • Current year only or archival versions available online?
  • Code archive or session law archive?
  • URL of Archival versions
  • If archival versions available, how far back?
  • Bulk download available?
  • Search capabilities – none, basic, advanced/filtered

Am I missing anything?