IRC log for #koha, 2006-02-18

← Previous day | Today | Next day → | Search | Index

All times shown according to UTC.

Time	Nick	Message
12:39	jungle	hello people
13:34	owen	kados: is the plan to create an opac system preference for color stylesheet like the intranet has?
13:52	kados	owen: yep
13:52		owen: the syspref's already in rel_2_2
13:52		owen: not sure if support for it is built in to the scripts yet
13:56		hey thd
13:58	dweezil19	can i ask a quick question about koha?
14:04	kados	dweezil19: sure
14:20	dweezil19	Is it possible to authenticate via active directory?
14:56	kados	dweezil19: hypothetically, yes
14:56		dweezil19: Koha has support for LDAP
14:57		dweezil19: can you use that with active directory?
14:57	dweezil19	i guess so
14:57		it is an ldap server so that should work (i think!)
15:00		thanks, I've got to run now
15:19	dce	has anyone here dealt with importing MARC data from Follett when ordering books from them? (Not migrating from Follett software.)
15:24	thd	kados: I am awake again, are you?
15:24	kados	thd: yes, but I have a conference call soon I must prepare for
15:25	thd	kados: after your call would be when?
15:25	kados	dce: could you expand on what you mean by that?
15:25		thd: probably around 3:00 or so
15:25		thd: I'll ping you when I am available to talk
15:26	thd	kados: I await your ping
15:26	kados	dce: if you mean integrating the order process within Koha's acquisition module, noone has done that to date
15:26		dce: but if we had a sponsor for such a feature it could be done very quickly
15:26		dce: if you'd like to talk more about it later, let me know
15:27	dce	kados, when ordering books from Follett - http://www.flr.follett.com/ - our library staff has the option of getting a disk with MARC records.
15:27	thd	dice: Do those records cost extra?
15:27	dce	We are looking at using Koha in a new school opening this fall and they want to know if they will still be able to import that info.
15:27	kados	dce: (In that case, you can import MARC directly into Koha from the disk ... )
15:27	dce	thd, I can ask
15:28	kados	dce: (though it may require some customization of the import file)
15:28		dce: to map your holdings data to Koha's internal structure for holdings)
15:29	dce	apparently they can provide the records in more than one format. I'll get more details
15:30	thd	dice: MARC records may always be imported at any time but you are not dependent upon Follett records. You could also copy catalogue over Z39.50.
15:32		dice: I am interested in knowing what the price differential is between purchasing books without records supplied and purchasing with records supplied.
15:32	dce	thad: copy catalogue from where over Z39.50? I'm afraid that I have very little experience with the library side of things but a lot on the tech side.
15:34	thd	dice: Any other library with a Z39.50 server, which maybe any other library from a neighbour to the national library in another country.
15:34	dce	thad: cool
15:35		thanks.
15:36		I'll lurk here for a while waiting for the librarian to give me more details then I'll let you know.
15:36	thd	thd: There may still be the advantage of convenience, known quality standard, and actually having a record that is hard to find elsewhere by obtaining records from you vendor.
15:37		s/you/your/
15:40		dice: I am interested in knowing how that value is priced or if it is unavoidably built into the service. How does the service price differ from the publisher's suggested retail price.
15:44		dce: my vision is focused now and I see that you are dce not dice as I had used.
15:44	kados	cancelled taht is :-)
15:44		that even
15:45		too much coffee today :-)
15:46	thd	kados: just say no
15:46	kados	heh
15:46		I don't drink coffee every day
15:47		thd: do you happen to know if it's possible to create frameworks within Koha that store fixed fields differently?
15:47		thd: I haven't played with the frameworks much
15:53	thd	kados: How differently do you want to store the fixed fields?
15:55		kados: I cannot see how anyone drinks coffee ever but I thought it tasted foul when I sipped it once and forever found it easy to say no.
16:28		kados: what does your question mean? Paul does have a plugin for modifying the most used UNIMARC fixed field. UNIMARC 100 $a is a fixed subfield that along with other 1XX works similarly to MARC 21 005-008 fixed fields.
16:29	kados	I just had someone ask about koha's ability to handle fixed-field itemtypes
16:30		material type and the like
16:31	thd	kados: You could write a plugin for those to use in the Koha record editor,
16:32		kados: Look at paul's plugin for UNIMARC 100.
16:34	kados	yea, suppose a plugin would work
16:34	thd	kados: look at the MOD mapping for material types also the complete MARC 21 Bibliographic manual hosted on TLC.
16:40		kados: This table is useful: http://www.itsmarc.com/crs/Bib0020.htm
16:41	kados	yea, that is useful
16:41		I could easily create a framework for each material
16:41		with a plugin that auto-filled the proper fixed field
16:43	thd	kados: I had started writing some Perl code months ago for determining material types from the MARC record but I aborted that effort with the number of fields involved in either general or special material designation. The question is complex and needs to be stored in a separate local use MARC field for Koha 3.0.
16:44		kados: Proper material typing is too inefficient to be done at query time.
16:46		kados: If you want a simple plugin for the material types that table and the related fixed field documentation will help you write a plugin.
16:46	kados	thd: thanks
16:48	thd	kados: The problem shown by that table is that you need more than one fixed field for even distinguishing whether language material is a serial or a book.
16:53	chris	morning rosa
16:53		morning thd and kados
16:54	rosa	morning
16:54		all
16:55		how's the weather in elly?
16:55		Welly, even
16:55		It's very middling here
17:04	thd	good morning chris and rosa
17:04	chris	sorry rosa, middling here too
17:06	rosa	bummer. I thought it was going to be beautiful
17:06	chris	it was yesterday
17:06	thd	kados: The real problems are reading the meaning of the coded and other values. Writing them to the individual fields is is easy for a plugin.
17:06	rosa	here too
17:06	chris	it might still get sunny .. its threatening too
17:09	thd	kados: A sophisticated plugin that added values to multiple fields at once would be nice though, that could be the reverse of a complex general and special material type reading script.
17:14		kados: It is much less work for the cataloguer to do things once for the whole record but much more work for the programmer. Yet, the parallel reverse script for reading is needed for even copy catalogued records.
17:19		kados: A change is eventually needed in how the frameworks function so that plugins can operate on discrete parts of fixed fields that correspond to an individual value. Fixed fields would then be much easier to manage.
17:39	dce	thd: The MARC records from Follett are $0.26/record.
17:44	thd	dce: That is probably a good price to even save you the trouble of sending a query to a Z39.50 server. What does Follett charge for the books themselves in relation to the publisher's suggested retail price?
17:54	jungle	hello
17:54		hi to all hope everything is good
17:54		: )
18:31	kados	thd: you still around?
18:31	thd	yes kados
18:32	kados	did you see stephen balkits latest email?
18:35		thd: after you read it ping me
18:35		thd: i'd like to talk about how best to respond
18:36	thd	kados: still reading
18:46		kados: I have read. I have corresponded with Steven in the past his details about Koha are not current enough sometimes.
18:46	kados	right
18:47		thd: any comments on his statement about the 650?
18:47		thd: what is he referring to?
18:47	thd	kados: He cares greatly about library standards as do I and every one working on Koha should.
18:47	kados	of course
18:47		MARC is a terrible standard
18:48		but believe me, koha developers do care about it
18:49	thd	kados: The problem with MARC is that in order to be a very meaningful standard LC has been very resistant to changes for mere data format fashion.
18:50		kados: Yet, after 4 decades that is a little problematic.
18:51	kados	well ... the problem with MARC as I see it is that it doesn't live up to its name as a machine readable record format
18:51		for instance
18:51		no indexer on the planet
18:51	thd	kados: I looked for a supposed reply from paul but found none.
18:51	kados	can deal with MARCs notion of subfield order
18:52		if $k follows $b it's subtitle if it follows $a it's title
18:52		it's absurd
18:52	thd	kados: full text indexers have no terrible problem.
18:52	audrey	kados: do you know a website about Marc? with lots of details and examples
18:53		I am still searching for one.
18:53	kados	audrey: sure ... just a sec
18:53	audrey	I don't have a card for KnowItNow at the moment
18:53		left the card in US
18:54	thd	audrey: http://www.itsmarc.com/crs/CRS0000.htm
18:54		audrey: http://www.loc.gov/marc/
18:55	audrey	have the loc one already
18:55	kados	audrey: this is my favorite intro ... nice and simple:
18:55		audrey: http://www.loc.gov/marc/umb/um01to06.html
18:55	thd	audrey: This is an excellent brief guide: http://www.loc.gov/marc/umb/
18:55	audrey	I need explanation of reason for subfields in 852 field
18:56		they still confuse me
18:56		more detailed than introductory
18:56	kados	audrey: I can explain them to you
18:56	audrey	thanks for the recommendations:)
18:56		great
18:56	kados	let me pull up a record
18:57	audrey	the classification number is created from 4/more bits of information
18:57		and goes into more than 4 MARC 852 subfields
18:57	thd	audrey: The examples in the LC documentation are the best clue.
18:57	audrey	I've looked at them, and am still confused
18:57	kados	audrey: you're confused about classification?
18:58	audrey	no
18:58	kados	audrey: what then?
18:58	audrey	i want more information about why the classification bits go into each subfield
18:58		the dewey number is easy
18:59		its subfield says its just for dewey
18:59	kados	are you looking at Koha's MARC or MARC in general
18:59		cause there is a difference
18:59	audrey	MARC in general, right now
19:00		how much difference is there?
19:00	thd	audrey: http://www.itsmarc.com/crs/Bib1424.htm is actually the complete LC documentation for 852 with extra HTML.
19:01	kados	audrey: thd beat me too it with the link
19:01		audrey: the thing to keep in mind with LOC classification
19:02		audrey: is that it was designed by the library of congress :-)
19:02	audrey	looking at it now:) will see if it clears the fog:)
19:02	kados	audrey: have you ever been there?
19:02	audrey	no
19:02		in DC, but not at LC
19:02	kados	audrey: well i have ... it's HUGE!
19:02		audrey: they have a serious amount of space
19:02	thd	audrey: Koha does not use 852 internally. I have a plan but that would require quite a bit of work.
19:03	kados	thd: (even with zebra?)
19:03	audrey	so the suffix subfield is really because they have departments inside of buildings
19:03	kados	audrey: exactly
19:03		audrey: it's really an internal taxonomy for LOC
19:03		audrey: based on their specific needs
19:04	thd	audrey: prefix use is common. Suffix use is rare.
19:04	kados	audrey: academic libraries use it too ... because their collections are also often quite large
19:04		audrey: and spread across multiple buildings and floors
19:04		audrey: etc etc.
19:05	audrey	am looking at the reasons for prefixes, dewey, cutter--why split between different subfields?
19:05		why not in only one subfield?
19:05	thd	kados: using 852 would require nontrivial changes to how Koha manages items.
19:05	audrey	koha already uses 852. i saw it yesterday
19:05	kados	thd: lets talk about that ... how nontrivial?
19:06		audrey: naw ... not for managing the items :(
19:06	thd	audrey: Actually, Koha does it that way.
19:06	audrey	but is mapped and searchable, right?
19:06	kados	audrey: though we can nab stuff from the 852 and put it in local use fields in the 900s
19:07	audrey	hmm
19:07	kados	where Koha stores item info currently
19:07	thd	audrey: you can place the whole MARC record inside Koha but Koha uses a local use field for managing items.
19:07	kados	lots of systems do actually
19:07	audrey	what is "manage"?
19:07	kados	including Dynix
19:08		audrey: well ... there are a few bits of information that Koha pays attention to with items
19:08		audrey: barcode, item type are two
19:08	audrey	right
19:08	kados	audrey: managing basically means changing the status of an item
19:08		checked out, on reserve, lost, etc.
19:09	audrey	not looking for that with classification #'s.
19:09	kados	nope
19:09	thd	audrey: This might help confuse you more but explains the origin of Koha item management. http://www.kohadocs.org/holdings.html
19:09	audrey	item management is not focus at moment
19:10		classification details are
19:10		not for status purposes
19:10	chris	ohh is that what its modelled on
19:10		cool
19:10		i thought it was just something olwen and I thought up
19:11	thd	chris: I have an unfinished revision for that document proposing changes.
19:11	audrey	why does marc divide the info on back of books (the sticker bits) into different subfields?
19:12		why not just one big, changeable field?
19:12	thd	audrey: MARC Koha works with the whole call number in one local use subfield.
19:13	chris	thats a good explanation thd
19:13	audrey	thanks, will be looking at all recent information
19:13		and clearing my fogged brain:)
19:14	thd	chris: That is also the most turgid writing that I have ever produced. At least it serves some useful purpose.
19:14	kados	thd: in your opinion, if we implemented your recommendations in the holdings.html doc, would folks like stephen stop complaining?
19:16	thd	kados: The holdings.html doc has a careless mistake in the mapping but I do have an almost finished revision that is much more comprehensive.
19:17		kados: To avoid the problems that Steven complains about a valid complete default MARC framework for MARC 21 would be needed.
19:18		kados: Even the UNIMARC default framework is full of errors.
19:19		kados: I could build one easily.. It would require a few days to carefully audit everything.
19:20		kados: Steven complains about the non-repeatable subfields being set by default to repeatable for 650 etc.
19:21	kados	thd: will that even be an issue now that we've got zebra?
19:21		thd: I don't think it will
19:21		thd: at least not for searching
19:22	thd	kados: It is still an issue for a record editor that uses the frameworks and also for the several months before 3.X is stable enough for production use.a
19:22	kados	right
19:23		hmmm
19:24		thd: don't we still have problems with subfield ordering?
19:24	thd	kados: The absence of 000-008 in the frameworks makes Koha look very poor to anyone who investigates and causes data loss when editing records.
19:24	kados	thd: ie not preserving it?
19:25		thd: but we _do_ have that now!
19:25	thd	kados: exactly
19:25	kados	thd: so it's more complicated than just fixing the default framework
19:25	thd	kados: Only if the user audits the framework and validates everything to add what is missing.
19:26	kados	ok ... so we need to make sure that 2.2.6 has a better default framework
19:26		that fixes those issues
19:26		the subfield ordering thing ... that's tricky
19:27	thd	kados: Ordering has been mostly fixed. There are still correctable issues relating to presentation.
19:27	kados	ahh ... has it
19:27		wonderful
19:27		thd: feel free to create a new valid framework on LibLime's demo
19:28		:-)
19:28	thd	kados: I would be eager to fix the default framework for MARC 21. Would it help me pay my bills?
19:28	kados	thd: sure :-)
19:29		thd: feel free to bill LibLime :-)
19:39		walk even
20:05		thd: in your new version of the holdings doc, it would be interesting to say what other systems base their holdings on recomendation 995
20:05		thd: some 'name dropping' would be nice there :-)
20:07	thd	kados: French systems only. It would not be very meaningful to your customers.
20:07	kados	thd: I think Dynix does
20:07		thd: I could be wrong though
20:08	thd	kados: Unless there is a French branch or subsidiary of a non-French company.
20:09		kados: The largest vendors support UNIMARC in some manner so they might have French localised versions.
20:10		kados: I will see if paul has an idea tomorrow.
20:13		kados: What might also be good is to know what software does not use MARC 21 holdings format or uses nonstandard holdings data in current versions.
20:14	kados	yep
20:15	thd	kados: Of course Koha should head to support MARC standard holdings. Even recommendation 995 support is an adaptation rather than a fully compliant implementation.
20:16	kados	thd: well ... OCLC doesn't :-)
20:16		thd: I'm not 100% convinced we need standard MARC holdings
20:16		I can see some advantanges
20:16		well ... before I continue down that thread
20:17		what would be involved in getting MARC compliant holdings into Koha?
20:17	thd	kados: OCLC should now be converting to the MARC holdings format according to their intention announced at the beginning of last year.
20:17	kados	it can't really be that complicated
20:17		ahh ... they are converting _to_ MARC ...
20:17		I thought they were converting away from it
20:18		yea, so ... how hard could it be really?
20:18		and what advantages would it afford us?
20:19	thd	kados: They announced at a conference at the beginning of last year that they would be using the separate MARC 21 holdings format records.
20:21		kados: I had started a message to you about supporting MARC 21 and now also UNIMARC standard holdings when you were driving off to meet with a law library client..
20:21	kados	thd: still have it?
20:22		be right back
20:22	thd	kados: Most of the text was still in my head. I could reconstitute it.
20:53	kados	we don't want Koha's holdings to be _tied_ to MARC standard holdings
20:53		because there are instances when libraries or other orgs will not want to use MARC bib records
20:55	thd	kados: How many libraries are there in the world that do not want MARC?
20:55		kados: I do not mean the ones that want MARC to be hidden.
20:56		kados: Hiding the complexity of MARC from the user can readily be done.
20:57	kados	marc is evil
20:57	thd	kados: Even paul's record editor does much for that.
20:57	kados	there's no backing down on that point :-)
20:57		I want to support it
20:57		in Koha
20:57		but I don't want to be tied to it
20:58	thd	kados: MARC is designed for the ease of record exchange and detailed careful data recording.
20:58	kados	thd: by a bunch of loony academics
20:59	thd	kados: Koha should support many different record types.
20:59	kados	thd: exactly
20:59	chris	i think you need to change that too
20:59		MARC was designed for the ease
21:00	thd	kados: However, MARC has the most carefully thought out solutions to the most complex problems of information records.
21:00	chris	now people want to use it catalog in
21:00	kados	thd: that's only partly true
21:00		thd: it's not a very wel-though out storage format
21:00	chris	ie its purpose has been overloaded and hence its no longer as easy
21:01	thd	kados: It was designed when every byte was precious.
21:01	chris	my main beef with MARC
21:02	thd	kados: MARC in XML is very extensible.
21:02	chris	is that everyone forgets the M is Machine
21:02		humans arent supposed to see it
21:02		ever
21:02		ever
21:02		:)
21:02	kados	well ... but also
21:02	chris	but because they do
21:02	kados	machines often don't know what to do with parts of it
21:02	chris	they broke it
21:02	thd	chris: That is exactly my point about hiding it.
21:02	kados	chris++
21:02	chris	im sure it was a much better thing
21:03		when it was a standard
21:03		not 12 different standards that arent standard
21:03	kados	true
21:03	thd	chris: machines know what to do if th programmers tell them how.
21:03	chris	yep
21:04		its still broken
21:04	kados	so ... more thinking out loud
21:04		we need a way to abstract our holdings format
21:04	chris	im not saying it was always broken, just it is now :)
21:04	thd	chris: there are over 50 variants but there has been significant unification of the variant formats so that is much less of a problem now.
21:04	kados	in the same way we're abstracting our searching format with Zebra
21:05	chris	yeah
21:05	kados	ie, I can put in dublin core records into Koha RIGHT NOW
21:05		and Koha will find them
21:05	chris	yep
21:05	kados	so we've made a huge leap forward there
21:06		we need to also abstract our record editor's underlying format
21:06	thd	kados: Most of you customers and the potential ones with the most money to spend will want MARC holdings.
21:06	kados	thd: so we'll give them those
21:06		thd: but I don't want to make the mistake we made last time with MARC support
21:07		thd: which is to hardcode it into the scripts, templates and database
21:07		we need to compile some general purpose assumptions about holdings
21:07		then have a way to configure the holdings mappings to a specific holdings format
21:08	thd	kados: Serials holdings can be very complex.
21:09		kados: Certainly, a fresh approach to any problem with the advantage of knowing the work that has gone before can produce a better solution.
21:11		kados: Yet, without spending a significant amount of time how will you find a general problem that includes all the issues for MARC and everything else.
21:12	kados	well, as I see it
21:12	thd	kados: Karen Coyle wants to kill MARC but many of her potential allies are intimidated by the LC superpower with the veto.
21:13	kados	holdings are used mainly for one thing
21:13		keeping track of the movement of items
21:13		right?
21:13	chris	thats my understanding
21:14	thd	kados: Karen speaks about the possibility of a system designed for supporting FRBR efficiently but she has no design herself.
21:14	kados	thd: we're not talkinng about FRBR yet :-)
21:15		i didn't think FRBR addressed managing the movement of items
21:15		I thought it was mainly for searching
21:15	thd	kados: That seems close.
21:15	kados	and FRBR's what, 11 years old?
21:15		with not a single app using it?
21:15	chris	thats only about 2 weeks old in library time tho :-)
21:16	kados	true
21:16	thd	kados: Implementing FRBR is very inefficient. Variant data problems in legacy records and too many CPU cycles for processing.
21:16	kados	anyway, I digress
21:16		the point is
21:16		we need to improve Koha's internal methods for managing items
21:16		that much is clear
21:17		for instance, we should be able to place reserves on specific items
21:17	chris	yep
21:17	kados	we should also allow itemtypes at the item level
21:17	chris	this is what i was saying about biblioitems .. the point of them was missed
21:17	kados	so what we need, is a list of things that the MARC standard for holdings can _DO_
21:18	thd	kados: I have a solution to some of that for 2.X.
21:18	chris	ohh good point kados
21:18	kados	then, it's a simple matter of doing it better inside Koha
21:18	chris	yep
21:18	kados	then mapping it to MARC for those that want it
21:20	chris	you using wifi at a bar?
21:20	kados	hehe yea, casa actually :-)
21:20	chris	cool :)
21:20	kados	where I do my best thinking :-)
21:20	thd	kados: I see more clearly, searching complex MARC serials holdings is very inefficient.
21:23	kados	thd: exactly
21:23		thd: are you up to writing some on what MARC holdings (serial and otherwise) allow a library to do?
21:24	thd	kados: You need a means of addressing the same complexity as MARC can.
21:24	kados	thd: can you explain what you mean by 'complexity'?
21:25	thd	kados: complex publication patterns, special issues, supplements, missing issues, title changes of the same publication.
21:27		kados: The problems are worst when the information has been recorded as a machine readability aid.
21:28	kados	thd: Kokha can do much of that already
21:28	thd	kados: There is are NISO and ISO standards for serials.
21:28	kados	interesting
21:29	thd	kados: yet what Koha can do already is as difficult to use as MARC and does not interface well with items although I know hdl partly fixed that.
21:31		kados: there are also part, whole, etc. record relationships for linking between sets of holdings data and bibliographic records.
21:32	kados	I need to digest that sentence
21:33		I _have_ afterall been drinking :-)
21:33	thd	kados: Unfortunately, the international standards are insufficiently detailed. They exist as a mere starting point constructed after the fact.
21:35		kados: The hierarchal relations are the most troublesome for encoding, decoding, and searching.
21:36	kados	thd: what kinds of searching would we need to do?
21:37		thd: what do the hierarchal relationships allow the system to do?
21:39	thd	kados: The system has to manage the relationships between serial supplement 5B and a monograph after a title change. Display whole runs with links to the records and missing issues. Etc.
21:39	kados	the title change should be easy
21:39		with authorities
21:40		links are cake too
21:40		already does missing issues
21:40	thd	kados: manage overlapping, embargoed, and gaps in electronic database coverage.
21:41	kados	coudl you expand on that last bit?
21:41		what's supplement 5B?
21:42	thd	kados: holdings can store information about indexing and abstracting database coverage as well as full text databases.
21:42	kados	k ... holdings can store info about where full text databases are
21:42		and it can store information 'about indexing and abstracting database coverage'?
21:43		I don't quite understand what you mean by that
21:43		does it say _how_ to index?
21:43	thd	kados: vendors will have gaps for single issues, spans of time, exclusion and coverage that changes on a sliding basis over time.
21:44		kados: Holdings can link to electronic databases, or hard copy, electronic databases can link back.
21:44	kados	thd: linking is easy
21:44		thd: we just need openurl
21:44		thd: about the gaps, spans of time, etc.
21:45		thd: have you seen any really good examples of serials managment functionality in a proprietary system?
21:45		thd: ie, do you understand how they work?
21:45	thd	kados: OpenURL yes but OpenURL needs detailed holdings for the resolver to use.
21:46	kados	thd: isn't that what we're talking about providing?
21:47	thd	kados: I have seen examples where the catalogues point to multiple hard copy and not yet come back next month when the embargo period is over electronic databases.
21:47		kados: I understand how OpenURL works.
21:48		kados: Karen Coyle did a fine job rewriting the documentation.
21:49		kados: There are published papers on link resolvers.
21:50		kados: The big secrets are keeping the information about electronic resources current when your vendor is constantly changing the coverage without overwhel;ming the system maintenance staff.
21:51	kados	right
21:53	thd	kads: 5B was an arbitrary example for something like a serial back cover pocket insert that goes inside a volume of a law book.
21:54		a supplement with the latest text revisions and case law updates.
21:55		kados: Supplements are theft targets and vital to track for law libraries.
21:56		kados: Not that they are replaced when stolen at the poor libraries.
21:56	kados	thd: before I forget: http://opactest.liblime.com/cg[…]ha/search-test.pl
21:56		thd: that's the CQL test that chris built
21:56		thd: probably not 100% CQL yet but it does a pretty good job
21:56		for two days of programming :-)
21:57		thd: Koha supports summpements right now?
21:57		thd: I've seen them in serials
21:59	thd	kados: The issue is not for some support for supplements but supporting the complex part/whole relationships for supplements, special issues, etc.
21:59	kados	I don't really understand what that entails
21:59		could you explain it?
22:02	thd	kados: Koha support for serials is based on regular publication patterns that are predetermined. It is not very flexible for accommodating changes in publication patterns in the midst of a period or changing and irregular patterns.
22:03	kados	that's true
22:03		so let's focus on one thing at a time
22:03	thd	kados: also, there is poor linking between bibliographic records where one record might cover the title as a whole, another an issue, another an article.
22:04	kados	???
22:04		one serial record you mean?
22:04		I dont' quite follow
22:05	thd	kados: holdings records can also be some arbitrary level below the title level.
22:07	kados	that's a little too abstract for me
22:07		I need some examples
22:08	thd	kados: You have the annotated edition of the state laws with many volumes and a monograph record for the set.
22:10		kados: The set itself has holdings record for the set as whole.
22:11		kados: The criminal code part has a record for its several volumes both bibliographic and holdings.
22:11		kados:
22:12		kados: The criminal code may have its own index record linked to the other volumes.
22:12	kados	and there are links between the different records? is that what you're getting at?
22:12	thd	kados: And each one can have separate holdings.
22:13	kados	meaning what exactly ... you can interact with the records at each level in the tree?
22:14		what kinds of things can you do with the separate holdings records?
22:14	thd	kados: furthermore, the books may be monograph records and the back pocket supplements may be serial records.
22:15		kados: separate records allow for easier management, however, there could be one hyper-complex record managing it all.
22:16		kados: Either way the system has to bring all the relations together.
22:17	kados	that's not too hard to do
22:17		we could tackle an indefinite heirarchy using either an adjacency list or a nested set
22:17	thd	kados: Separate bibliographic records for various parts allow the inclusion of details about parts that would not have a proper place in a general record.
22:18	kados	you could assign ids for each node in the tree
22:18		nodes could be ordered by date
22:18		peer nodes that is
22:18		if we used nested sets
22:18	thd	kados: the difficulty is bringing a complex search path together when the records were not created in a neat top down manner.
22:18	kados	actually, with nested sets the search path is quite simple
22:19		you can traverse the heirarchy fairly easily
22:19	thd	kados: Also, searching hierarchies can be inefficient.
22:19	kados	thd: nested sets are ver efficient
22:19		very even
22:20		I've been doing some work with them for PINES
22:20		what we need are some specific case examples
22:20		very specific examples
22:20		like actual serial records
22:20		for real serials
22:20	thd	kados: I mean that only relatively as compared to flat data. Extra CPU cycles add up in large collections.
22:23		kados: Obtaining bibliographic records is easy, finding complex holdings records may be more difficult..
22:23	kados	a well designed heirarchy can be very fast
22:23	thd	kados: Koha has historically used the simplest set of records as examples.
22:24		kados: You need to have access to the most troublesome records for testing that challenge all aspects of a system.
22:26		kados: Fast is merely relative but part of my point is that the records may have tortured relations with one way mappings. Koha could find the implied relationships for easier access.
22:28		kados: A large problem will be that accessible records will make extensive use of human readable relation encoding that some system needs to parse in order to work well.
22:30		kados: Koha uses machine readable encodings for serials now, most serials holdings data was actually meant for human readability because the machine parsable holdings fields had not been designed.
22:31		kados: The most high demand holdings would have been redone as new records were created and systems supported their creation.
22:33		kados: There is a wide variation in actual practise. If you can parse textual holdings fields even partly with regular expressions you will have a migration advantage.
22:36		kados: After all the nonsense to obtain a client, I assume that data migration is the next most time consuming part of your services. Am I correct?
22:37	kados	of course :-)
22:38	thd	kados: The better your support for mapping MARC holdings onto whatever the lower your costs will be.
22:39		kados: Of course, you need a two way map in order not to be an evil trap for your customer like so many proprietary systems are.
22:42		kados: Half the problem with Koha serials holdings now is the user interface requires careful application of machine encoded values, and is fairly brittle.
22:42	kados	true
22:43	thd	kados: Record creation and modification is of comparable complexity to migration.
22:45		kados: Currently the migration path would be tedious except that hdl designed holdings for libraries that usually had simple reliable publication patterns.
22:46		kados: Who is the intended audience for Koha 'fact sheets'.
22:47		?
22:50		kados: I think it will be easier for me to change a default MARC 21 framework on my own system and send you an SQL file. I will have better access to the data than if I work on yours.
22:53		kados: The ISBD mapping took about ten hours and I never audited it when I was done but I had to follow some ambiguous ISBD rules. At least there was no consequence for data loss in case of error there.
22:56	kados	ok
22:56		that's fine
22:57	thd	kados: "For whom are 'fact sheets' intended?
22:58	kados	potential clients of course
23:00	thd	kados: The bean counters who do not know MARC from Dublin Core and make all the decisions or the cataloguers who dream MARC and have no influence?
10:12	kados	paul: http://opactest.liblime.com/cg[…]a/search-test.pl?
10:12	paul	paul greets kados
10:12	kados	import of one of my client's data seems to have finished
10:13		and chris's CQL query is working nicely
10:13		hi paul :-)
10:13	paul	christian library it seems.
10:13		The Book of Ezekiel /
10:13	kados	yep :-)
10:13	paul	The Book of Daniel /
10:13		The Book of Isaiah /
10:14		(don't ask me what I entered as search term ;-) )
10:14	kados	try 'book not daniel'
10:14	paul	works nicely
10:15	kados	I'm still not sure all the records made it in
10:15		as the last record that loaded threw an error
10:15		about invalid ZXML
10:15		XML that is
10:15		I was seeing similar errors after importing about 200 items
10:16		the import would crash
10:16		but I think that was because we wern't destroying the zebra connection
10:16		and zebra was getting overloaded with too many connections
10:17		import is quite slow the connect - import one record - disconnect way
10:17		I wonder if we can have just one connection for the entire process
10:17		might be a speed improvement?
10:18		i also noticed that it gets slower as more items are added
10:18		so at the beginning, it was about 10-20 per second ... but after 30,000 it was about 1 per second
10:18		just some things I've noticed :-)
10:30		paul: in your opinion, should we use the create()/connect() method to maintain the same object while updating records?
10:30		http://search.cpan.org/~mirk/N[…]b/ZOOM.pod#___top
10:31	paul	??? not sure to understand what you mean.
10:31	kados	right now
10:31		we do:
10:31		$Zconn = new ZOOM::Connection(C4::Context->config("zebradb"));
10:31		update a record
10:32		$Zconn->destroy();
10:32		I was hoping it would be faster to do:
10:32		$Zconn = new ZOOM::Connection(C4::Context->config("zebradb"));
10:32		update many records (all in fact)
10:32		$Zconn->destroy();
10:32		does that make sense?
10:35	paul	maybe you're right, but with mod_perl that will be automatic behaviour I think (without destroy), while without mod_perl, we would have a destroy when the process ends
10:36		(writing a mail on koha-devel to give -quite good- news from Ouest-provence)
10:37	kados	woo hoo!
10:47		paul: did you write the zebra_create method in Biblio.pm? or did chris?
10:47	paul	I did
10:47		(not sure the commited one is the last I have)
10:48	kados	ok
10:50		I'm thinking we should change the name
10:51		there are several methods in the ZOOM::Package module
10:51		one of them allows 'createdb'
10:51		if we had a generic ZOOM::Package wrapper we could pass in options that we want
10:54		it looks like we could pass in a hash object for $options
10:54		and then we need an 'operation' for the $p->send method
10:55		I may work on this routine a bit
10:56	paul	(createdb is already in some code I wrote. note also that there is a dropdb... that is buggy and do nothing)
10:56		mail sent to koha-devel, should arrive in 4 hours ;-)
10:57	kados	heh
10:57		hmmm, dropdb is buggy eh?
10:57		did you notify Mike Taylor?
10:57	paul	yep. does nothing.
10:57		it's mike that warned me ;-)
10:57	kados	ahh :-)
10:58		do you like my idea for having one ZOOM::Package method in Koha
10:58		and passing it options, operation, etc.?
10:59	paul	something like the ->dbh in Context iiuc
10:59		yes.
10:59	kados	yes I am speaking of two things

← Previous day | Today | Next day → | Search | Index