← Previous day | Today | Next day → | Search | Index
All times shown according to UTC.
Time | Nick | Message |
---|---|---|
17:10 | kados | thd: I'm here |
17:27 | thd | kados: I am back |
17:29 | kados: "until recently there were few stated instructions concerning this order" | |
17:30 | "Modern classification schemes contain instructions, called citation formulae." | |
17:32 | "But in most cases, subject catalogers had to rely on arrangements already established in the list as a guide." | |
17:32 | that would be the list of subject headings | |
17:34 | "New headings were generally established according to existing patterns." | |
17:34 | "Even so what was done over the years was not always consistent." | |
17:35 | "Thus there is considerable variation on this score in LCSH as it stands now." | |
17:36 | kados | wonderful :-) |
17:39 | thd | kados: this is from Chan 1995 |
17:39 | kados it does get better | |
17:40 | kados: that was from the beginning of a chapter section on subdivision ordering | |
17:41 | kados: there was a conference in 1991 which considered rectifying this problem for better machine readability | |
17:42 | kados: this edition gives the two basic current patterns. | |
17:42 | kados: I gave you place--topic--time--form yesterday | |
17:44 | kados: the rule for that is when the subject string begins with a geographic heading, a 651 | |
17:44 | kados: rule 2 is for topical headings, 650 | |
17:45 | kados: then use topic--place--time--form | |
17:46 | kados: then we have the rule from what john quoted yesterday | |
17:48 | kados: when the string contains another topical element, $x, the elements may be arranged in one of the following orders | |
17:49 | kados: we do have guidance for choosing which but only if we have the authority file showing 008/06 or the printed guide of LCSH. | |
17:50 | kados: so with $x we have either topic--topic--place--time--form | |
17:50 | kados: or topic--place--topic--time--form | |
17:51 | kados: the heading authority 008/06 tells us which to use. | |
17:52 | kados | interesting |
17:52 | that's good news | |
17:53 | good detective work thd! | |
17:53 | I posted the question to autocat as well, dont' know if you saw that | |
17:53 | thd | kados; that last bit was what I was trying to explain to you last night |
17:53 | kados | I haven't eaten yet today |
17:53 | so I'm gonna go do that | |
17:53 | be back soon | |
17:54 | thd | kados: I did not see it appear but the list is moderated so I would not expect the question to appear before Monday |
18:08 | kados: when you are back there is something informative from page 400 | |
18:17 | kados | i added some cool options to the default facet: |
18:17 | http://zoomopac.liblime.com/cg[…]koha/opac-main.pl | |
18:17 | thd | kados: page 400 is really good |
18:18 | kados: there is a discussion about whether the order really matters in the era of key work matching | |
18:18 | kados | hmmm, they're not working :( |
18:19 | thd | \kados: who they? |
18:19 | kados | thd: something I was working on |
18:19 | thd: continue :-) | |
18:19 | thd | kados: but the order has semantic value |
18:21 | kados: sometimes it makes a real difference to the meaning about how things are ordered and you can see the way subtle word changes in MARC 21 subfield names means something different | |
18:21 | than previously | |
18:22 | kados: there is an example given here | |
18:24 | kados: "For example. the heading Law--Periodicals--History is used for a history of law journals, and Law--History--Periodicals is used for journals about legal history." | |
18:24 | chris | yep quite different |
18:25 | thd | kados: note how the pattern in the first instance does not follow one of the common rule patterns given |
18:26 | the first pattern is topic-form-topic where we previously had patterns given with form at the end | |
18:26 | hello chris | |
18:27 | chris | hi thd |
18:27 | thd | chris, kados: now for the best sentence in the book |
18:28 | chris | although in the case above it is topic--topic--form .. as periodicals is the topic it just looks like topic--form--topic |
18:29 | which is what makes rules so tricky :) | |
18:30 | thd | chris, kados: "The instructions given in "Subject Cataloging Manual" have become so detailed and voluminous that they have become difficult for many outside the Library of Congress to apply." |
18:30 | chris | :-) |
18:30 | let alone the public to comprehend :) | |
18:31 | thd | chris: Periodicals as used here might always be considered a form even if it is used as a topic as you say. |
18:31 | chris | yeah, language isnt precise enough :-) |
18:31 | kados | hehe |
18:32 | pretty pointless to maintain a taxonomy that's only known by one group of people for which there exist no tools capable of making use of it | |
18:34 | thd | chris: yesterday I saw an account of the problem where a user without specialised library training could never guess the pattern and then of course he could also never guess the authorised form but the correct terms can be found with indexing tracings and references in authority files. |
18:36 | kados: so you build the tools to allow everyone to use it | |
18:36 | chris | yep |
18:36 | kados | thd: that's the idea :-) |
18:36 | but shame on LOC for choosing voyager | |
18:36 | chris | yeah who names an ils after a space shuttle anyway |
18:36 | thd | kados: it is not difficult if you presume that the existing records are correct and do not worry much about how to determine the correct order |
18:37 | kados | thd: I thought we were planning to try to determine the correct order |
18:38 | chris | what we could do |
18:38 | is use some flash | |
18:38 | and just have them rearranging themselves constantly | |
18:38 | cover all our bases :-) | |
18:38 | kados | heh |
18:38 | thd | chris: was that not also the name of the probe with the indecipherable golden disk full of pictures and primes |
18:39 | chris | hmm i think you are right thd |
18:39 | thd | oh there is more |
18:40 | just another reference to the 91 conference to fix all this and I guess when this book was published in 95 everyone was awaiting approval of their recommendations | |
18:40 | chris | ok, lunch and a meeting for me, see you guys later, good luck with the subject ordering |
18:41 | thd | kados: I was right as I said about the second $z being subsidiary to the first. That is a common rule. |
18:43 | kados: pretty good for ten minutes access to a library where I have no borrowing privileges | |
18:44 | kados: it did help that I knew exactly what book to find and ran to the copy machine before they closed | |
18:46 | kados | :-) |
18:47 | thd | kados: so it would be good to know if the recommendations of the Subject Subdivisions Conference from 91 were ever formally adopted |
18:48 | kados: maybe someone on autocat will say or maybe Google has indexed the answer | |
18:50 | kados | hopefully |
18:52 | thd | kados: http://www.loc.gov/catdir/cpso/subdconf.html |
18:58 | kados | I can't believe they had a converence on subject subdivisions :-) |
19:07 | thd | kados: I had been reading about the OCLC work on authority file strings. It seemed very impressive. It seemed to have started with an effort to fix spelling mistakes in subject headings in their union catalogue. |
19:08 | kados: wouldn't you want to attend that conference? :) | |
19:08 | kados | hehe |
19:14 | and with that, I'm heading back home | |
19:14 | be back on soon | |
20:04 | thd | kados: I had been reading recently about much of the work which had received an impetus from the conference but I had not seen that conference as a starting point. |
20:06 | kados: the report at http://www.loc.gov/catdir/cpso/subdconf.html basically says that the problem is too big to fix but we will fix what we can afford to fix. | |
20:07 | kados | :-) |
20:08 | thd | kados: OCLC is responsible for trying to upgrade many of the library records |
20:09 | s/library/libraries/ | |
20:10 | kados: the big problem with upgrading records from what I have read is what happens when one authorised heading is cancelled and replaced by two narrower ones. | |
20:12 | kados: then there is no clean and obvious path so there is a computer recommendation made from key words found in the record and a human then confirms or corrects the recommended choice. | |
02:00 | toins | hi all |
02:17 | salut bruno | |
02:19 | btoumi | salut toins |
02:19 | hi all | |
02:22 | comment avs tu toins? | |
02:22 | toins | ca va |
02:23 | et toi N | |
02:23 | ? | |
02:24 | btoumi | ca va bien malgres les nuit a 30°c :=( |
02:24 | toins | ah oui... je connais ça aussi |
02:41 | btoumi | trop dur pour dormir |
03:01 | hi hdl | |
03:01 | hdl | hi. |
03:01 | back from holidays | |
03:01 | btoumi | ah ok good holidays? |
03:02 | toins | hello hdl !!! |
03:02 | hdl | helo |
03:02 | Is paul around ? | |
03:03 | toins | I think he is at his office |
03:03 | btoumi | chris: areu around? |
03:03 | toins | but still without DSL ... |
04:59 | btoumi | toins are u around? |
05:03 | tumer | btoumi:did you sort your calendar scripts? you did not ask anything so i am assuming you did |
05:05 | btoumi | tumer: i work on fines and i recreate the fines2.pl to use day non worked and special days and i create specials fonction to calculate that |
05:05 | taht why i don't anithing to u | |
05:06 | tumer | calendar.pm already has those functions in it |
05:06 | btoumi | yes but i must work with table from holydays module |
05:07 | because day closed depends the library | |
05:08 | tumer | when we calculate a fine we do not use this module. The reason is we add days to date_due when issuing |
05:08 | so if the library is closed the date_due is given to next day automatically | |
05:09 | the fines module comes into action after that | |
05:10 | btoumi:is that how you are doing as well | |
05:12 | btoumi | i create 2 function getspecials holidays and getrepeatableholydays |
05:13 | the first function return how many days there is between date due and date of the day | |
05:14 | the second calculate how many day are like the repeatable day in the table | |
05:14 | after i do the sum | |
05:15 | and we don't calulate fines before the "after day" | |
05:15 | from issuingrules | |
05:17 | tumer | ok thanks |
05:18 | kados: u around? if not ping me when u are I have a big problem with the new API | |
06:58 | kados | tumer[A]: I'm here ... |
06:58 | tumer[A]: which new API? I just invented a new one last night :-) | |
06:59 | tumer | hi kados |
07:00 | well splitiing items from biblios has created a problem | |
07:00 | i can not search a title say in a specific branch | |
07:01 | kados | hmmm |
07:01 | tumer | i have tried all tricks i know but zebra does not do linked seraches |
07:01 | kados | you will need to use yaz proxy I think |
07:02 | have you asked ID about this? | |
07:02 | tumer | they seem to be on holiday |
07:02 | kados | hehe |
07:03 | tumer | well i can do async mode search and search both items database and biblios database simultanously |
07:03 | kados | yea? |
07:03 | tumer | just a sec |
07:04 | kados | what I think we need is 'merged result sets' functionality |
07:10 | tumer | yes, but for that we have to be able to link databases together say with biblionumber |
07:11 | until somebody comes up with some idea I am stuck | |
07:13 | you have not submitted anything from the zip file so i am keeping the new biblio and search until this case is solved | |
07:13 | kados | ok ... |
07:13 | tumer | and by the way the new API is it this facets? I could not get it to work |
07:13 | kados | I would ping ID again ... they are supposed to respond quick quickly |
07:14 | tumer | zoomopac at liblime gives me only facets and no results |
07:14 | kados | ? |
07:15 | what browser are you using? | |
07:15 | (haven't tested in IE, but it looks fine in FF and Safari) | |
07:16 | what I've been doing is actually quite simple | |
07:16 | I just discovered it last night and it makes some hard things very easy | |
07:17 | my @ccl_query = $query->param('ccl_query'); | |
07:17 | foreach my $ccl (@ccl_query) { | |
07:17 | $ccl_query .= "$ccl "; | |
07:17 | } | |
07:17 | this means that in the template I can have something like: | |
07:18 | <select name="ccl_query"> | |
07:18 | <option value="keyword"> | |
07:18 | </select> | |
07:18 | <input tyle="text" name="ccl_query"/> | |
07:18 | and a submit | |
08:04 | btoumi | hi paul |
08:05 | paul | hello btoumi |
08:08 | kados | hi paul |
08:08 | paul | hello kados |
08:08 | my opac-zoomsearch works really fine. | |
08:08 | kados | cool |
08:08 | I'm going to be making quite a few changes to make the code simpler | |
08:08 | paul | i've done some hacks for UNIMARC, but i'll have to synch with your recent commits... |
08:08 | i have a last question | |
08:09 | my datas are in UTF-8 | |
08:09 | kados | I will probably abandon use of PQF |
08:09 | paul | in zebra as well as in SQL |
08:09 | kados | and instead use CCL |
08:09 | right | |
08:09 | paul | when I search in opac-zoom search, accented chars are shown as latin1 chars (a , instead of a é for example) |
08:09 | I have investigated a lot, and found the culprit : | |
08:10 | kados | (I suspect the filehandle must be opened with utf8 flag set |
08:10 | paul | in USMARC.pm, line 170 : |
08:10 | if ( $marc->encoding() eq 'UTF-8' ) { | |
08:10 | warn "ENCODING $tagdata"; | |
08:10 | # $tagdata = marc_to_utf8( $tagdata ); | |
08:10 | } | |
08:11 | if I comment $tagdata = ..., it works well | |
08:11 | kados | hmmm |
08:11 | is that the same line that Tumer commented out? | |
08:13 | paul | hi tumer |
08:13 | tumer | hi paul. |
08:13 | paul | can you look at immediate logs ? |
08:13 | tumer | the problem with usmarc.pm is a bug |
08:13 | kados | paul: can you add a note (tumer too) about what you discover about encoding to the encodingscratchpad on the wiki? |
08:14 | paul | ok kados. |
08:14 | tumer | i have commited a new usmarc.pm corrected to koha cvs |
08:14 | kados | I have been keeping notes there |
08:15 | tumer | paul i have a problem with new db design. separating items resulted in not being able to search linked databases |
08:15 | paul | yep, i've seen your mail |
08:15 | tumer | i could not find a solution and i am stuck |
08:16 | paul | maybe there is no solution, except staying with biblio & item in the same zebra DB ? |
08:16 | tumer | yes but MARC definition does not allow that for multiple items |
08:17 | paul | ??? |
08:17 | tumer | we have to have separate holdings marc |
08:18 | can you follow me paul? | |
08:18 | paul | i'm afraid no |
08:20 | tumer | well according to LoC if there is more than one item you are supposed to create separaete marc records for it and not embed it in |
08:20 | so we have a items marc and biblios marc | |
08:20 | even if i put them in same zebra db i cannot link them in zebra | |
08:21 | i cannot search title and branch and get a merged result | |
08:21 | is it more clear? | |
08:21 | paul | yas, thx. previously, we embeeded the items in MARC record. why MUST we change this behavious ? |
08:22 | for me & my customers, it's useless today. | |
08:22 | in France/UNIMARC, we use recommandation 995. | |
08:22 | there is a standard for unimarc holdings as well, but "nobody" uses them, afaik | |
08:22 | tumer | kados wants more MARC compliant behaviour |
08:23 | i wanted to reduce indexing time whith circulation data | |
08:23 | but now in wain | |
08:24 | in MARC21 its more than 1 field 8.. for holdings | |
08:24 | kados | tumer: yaz-proxy may help you |
08:24 | tumer: and write ID again (be sure to cc me) supportindexdata.dk | |
08:25 | tumer: asking them if what you want to do is possible | |
08:25 | paul | tumer: where is your fixed usmarc.pm ? I can't find it |
08:25 | tumer | kados: yaz-proxy does not merge just balances the work load between databases ý think |
08:25 | kados: do you remember where i send usmarc.pm -to you was it | |
08:26 | kados | to the list |
08:26 | tumer | oh yes attachemt to koha-devel |
08:26 | kados | the problem I'm facing with MARC::* |
08:26 | paul | ok, gotcha |
08:26 | kados | is that I've spent so many long hours on it |
08:26 | and i can't get the maintainer's to agree there is a problem | |
08:27 | because there are no test cases that can be easily reproduced | |
08:27 | btoumi | paul: what is gotcha? |
08:27 | kados | that clearly show the problem |
08:27 | btoumi: it means 'I understand' | |
08:27 | paul | btoumi: the usmarc.pm link |
08:27 | tumer | even reading the script its obvious |
08:27 | paul | (and in this case : "ive got it") |
08:27 | btoumi | ok thanks all |
08:27 | kados | tumer: did you ask per4lib about whether it's a bug? |
08:28 | tumer | it says if the leader says UTF8 convert from marc8 to UTF8 which is stupid |
08:28 | kados | it's OOP so there are some other things happening in there |
08:28 | but I haven't had time to investigate too much | |
08:29 | so I suppose I should trust you :-) | |
08:29 | paul | it runs a "decode" (from Encode perl package) to decode to utf8 something that already is utf-8 |
08:29 | I agree with tumer it's probably a bug. | |
08:29 | kados | ok ... so we have two people to confirm this |
08:29 | tumer | also gets the characters decode(UTF8) and leaves it like that |
08:29 | kados | what we need is a small test case to prove that it's a problem |
08:29 | and someone needs to write to perl4lib | |
08:29 | to say 'this module has a major bug ... here it is ... here is the test that proves it ... here is the solution' | |
08:30 | so that we can continue to use the standard module rather than branch our own version | |
08:30 | paul | s/it/if/ |
08:30 | kados | thanks |
08:30 | tumer | i am very busy with the new system setup |
08:31 | i am experiencing lots of timeout from zebra while indexing | |
08:31 | 40 cataloguers working at the same time and zebra cannot keep up | |
08:32 | that was one of the raesons trying to seperate items. I did it and i ahve to go back | |
08:33 | kados | tumer: do you issue items yet? |
08:33 | tumer: before you waste all that effort, I'd ask ID if there is anything that can be done for merging result sets | |
08:33 | tumer: maybe even some sponsored development or something | |
08:34 | tumer | issue?? |
08:34 | dewey | issue is moot for geographic subjects 651 where is does not matter if $z comes between $z or not |
08:35 | kados | tumer: check-out / check-in |
08:35 | tumer | kados: currently circulation desk is closed |
08:35 | kados | ahh ... |
08:35 | for summer? | |
08:35 | or has it never been open? | |
08:36 | tumer | special closure for a month |
08:36 | kados | for what? holiday? |
08:36 | koha? :-) | |
08:36 | tumer | to catalogue 3000 books a day |
08:36 | kados | wow |
08:36 | tumer | before the academic year |
08:36 | kados | are you using shadow indexes? |
08:37 | tumer | i tried it all it even gets slower |
08:37 | paul | tumer & kados : could you answer my mail on koha-devel : "[Koha-devel] zebra config files : about default.idx" ? |
08:37 | kados | I had an idea about using shadows |
08:37 | only run a commit once every 10 minutes or so | |
08:37 | rather than after each change | |
08:37 | I was hoping it would improve the speed | |
08:37 | have you tried that? | |
08:38 | paul: looking now | |
08:38 | tumer | thats normal during this period but not when we are issuing |
08:38 | paul:i think the problem is more than that of tabs | |
08:39 | its too long what i think so i will write to you later | |
08:41 | kados | k |
09:16 | btoumi | question for all . the sub getiteminformation (from Cir2.pm) will not disapear in futur? |
09:17 | toins | btoumi, at least, it will be renamed to GetItemInformation |
09:18 | tumer[A] | i have kept it in modified API as well |
09:18 | btoumi | j'allais te le demander toins |
09:18 | toins | hehe |
09:18 | btoumi | paul yes it's a fonction wo give all information about item |
09:19 | paul | so it's proper place is Catalogue.pm, not in Circulation ! |
09:20 | btoumi | i don't think to look in catalogue.pm |
09:20 | paul | Catalogue.pm has to be used for every catalogue related operation, except search & cataloguing. |
09:21 | so it should be the best place I think | |
09:21 | btoumi | i think so but i never work on this part of koha ,i look for equivalent function |
09:22 | where i can find Catalogue.pm | |
09:22 | ? | |
09:22 | paul | don't mind with this for instance, ToinS will solve it when it's code cleaning reaches this module ! |
09:23 | btoumi | strange i don't find Catalogue.pm |
09:24 | bookseller.pm | |
09:24 | i suppose ;=) | |
09:24 | toins where is Catalogue.pm | |
09:25 | ? | |
09:27 | toins | btoumi, I don't know |
09:28 | btoumi | paul: u talk about Catalogue.pm can u tell me where is this module? |
09:29 | paul | maybe it does not exist yet, but he will be created later ;-) |
09:29 | or maybe it will be in Biblio.pm | |
09:29 | btoumi | ok thanks |
09:29 | ;=) | |
09:35 | paul : yes it's in biblio.pm | |
09:35 | its => getbibliofromitemnumber | |
09:36 | and i now toins that whe must rename this function GetBiblioFromItemnumber | |
09:36 | toins | ;-) |
09:36 | btoumi | toins: is it righ*? ;=) |
09:37 | toins | it's depend on what this function do exactly.... |
09:46 | btoumi | return all information form biblio,items biblioitems with an itemnumber |
09:48 | paul: is it a good choice to use getbibliofromitemnumber from biblio.pm? | |
09:50 | ty | |
09:50 | paul | hdl around ? |
09:50 | hdl | hi paul |
09:52 | btoumi | toins: are u around ? |
09:52 | toins | yep |
09:53 | btoumi | i don't change name of function because there is a lot of sub with bad name |
09:53 | toins | no problem |
09:53 | btoumi | and i know u do it later |
09:54 | toins | ok |
09:54 | btoumi | if there is only one i do :=) |
09:56 | paul | hey, shedges is with us for a few minuts ! |
09:56 | welcome here dear friend | |
09:58 | shedges | hi paul -- how are you? |
09:58 | paul | fine, thx. although a little bit hot in Marseille this month |
09:58 | & august won't be better probably :-( | |
09:58 | shedges | From watching the Tour de France, I think ~all~ of France is hot this month! |
10:12 | kados | hey shedges , how's it going? |
10:13 | shedges | hey kados! |
10:13 | I added image (binary) files for the Users Guide to CVS this weekend | |
10:14 | kados | cool |
10:14 | I'm totally swamped this week :( | |
10:15 | shedges | no versioning support, of course, but a place to store them for retrieval |
10:15 | good work? or busy work? | |
10:15 | kados | but I hope to have some time soon to work on linking kohadocs.org up to CVS via rsynch |
10:15 | mostly good work | |
10:15 | http://zoomopac.liblime.com is maturing | |
10:15 | shedges | good deal! |
10:15 | kados | I was hoping to get 2.3.1 rolled out yesterday but didn't quite get to it |
10:15 | we need a new design :-) | |
10:16 | I've been comparing out OPAC design to the linkes of Amazon.com, ebay, google, and some of the nice OPACs out there | |
10:16 | s/out/our/ | |
10:17 | evergreen's going quite well also: http://demo.gapines.org | |
10:17 | (design wise) | |
10:17 | shedges | Who's doing the design for you? |
10:18 | kados | of evergreen? one of the PINES developers |
10:18 | shedges | no, zoomopac |
10:18 | kados | I only criticize :-) |
10:18 | ahh ... that'd be me :-) | |
10:19 | owen and I are gonna talk about it this week though | |
10:19 | shedges | OK, now I gotta look... |
10:19 | kados | so hopefully he'll have some ideas |
10:19 | hehe | |
10:20 | paul | kados : the zoomopac is not ugly, it's for developpers ;-) |
10:21 | guys, I have to leave... | |
10:21 | shedges | bye paul |
10:21 | kados | ciao paul |
10:21 | paul | bye shedges, nice to read you |
10:21 | see you tomorrow kados | |
10:21 | thd | tumer[A]: are you still there? |
10:21 | toins | bye paul ! |
10:21 | thd | good evening paul |
10:31 | kados: I am going to change my vote and opt for answer 5 on the autocat list, the one that you did not offer as a choice. | |
10:37 | kados | thd: which one is that? :-) |
10:38 | thd: is it the 'don't attempt this at home' one? :-) | |
10:38 | thd | 5. |
10:38 | $a Architecture | |
10:38 | |->$x History | |
10:38 | |->$z Illinois | |
10:38 | | |-->$z Chicago | |
10:38 | |->$v Pictorial Works. | |
10:38 | kados | ahh |
10:39 | yea, that's a goodun | |
10:39 | so that implies it's a pictoral history of Chicago Illinios ... but where does architecture come in? | |
10:40 | shedges: are you on the autocat list? | |
10:41 | thd: I'm adding 5 to my liset | |
10:43 | thd | kados: my suggestion is that unless one the standard subdivision sequence is broken then keep the subdivision of the same kind (topic, $x, in this case) as $a with $a. |
10:44 | kados | http://kados.org/hierarchy.txt |
10:44 | thd | kados: that may change the order of some forms but not where it affects the meaning. |
10:46 | kados: the second line in your example 5 does not line up correctly | |
10:46 | like the ones above | |
10:46 | kados | oops |
10:47 | fixed | |
10:47 | shedges | kados: nope, not on the autocat list |
10:47 | thd | kados: and I indented Chicago too much |
10:48 | kados | shedges: here's a puzzle for you: http://kados.org/hierarchy.txt |
10:48 | thd | kados: take out a dash from Chicago |
10:48 | kados | thd: done |
10:50 | thd | kados: so my proposal changes the order given for topical subjects with topical subdivisions which are subdivided geographically |
10:51 | kados: the actual string is merely a convention and should be malleable for taxonomy purposes as long as the meaning does not change | |
10:52 | tumer | kados: my professor tells me you can not change the order as you wish if its created as $a $z$x it has to stay that way |
10:52 | thd | tumer: is the problem for separate holdings that you cannot bring multiple databases in the same result set? |
10:52 | tumer | it has a meaning on stress |
10:52 | hi thd | |
10:52 | thd | tumer: hello |
10:52 | dewey | hi, thd |
10:52 | tumer | thd:yes |
10:53 | kados | tumer: yes, I'm not proposing to change order ... just to nest each element properly |
10:53 | tumer | the order is the nesting in way |
10:53 | $a alwys being first | |
10:54 | $x$y$z change places | |
10:54 | thd | tumer: why not put them in the same database in separate records and search by 000/06 to identify the holdings records |
10:54 | ? | |
10:54 | kados | tumer: so your prof chooses #4? |
10:54 | tumer | thd: how do you search some title belonging to some branch? |
10:55 | shedges | kados: I'd say: |
10:55 | $a Architecture | |
10:55 | |->$x History | |
10:55 | |->$z Illinois | |
10:55 | |->$z Chicago | |
10:55 | |->$v Pictorial Works. | |
10:56 | (based on MODS, I guess) | |
10:56 | http://www.loc.gov/standards/m[…]line.html#subject | |
10:56 | tumer | kados: number 4??? |
10:57 | kados | tumer: http://kados.org/hierarchy.txt |
10:57 | tumer: show that to your prof, see what he/she says :-) | |
10:57 | shedges: looking now | |
10:58 | thd | tumer you search for holdings by finding 000/06, please excuse the PHP syntax it is easy to convert to Perl |
10:58 | tumer | shedges's looks right |
10:58 | thd | } elseif ($tag000_pos06 == 'u' || $tag000_pos06 == 'v' || $tag000_pos06 == 'x' || $tag000_pos06 == 'y') { |
10:58 | $recFormat = 'holdings format'; | |
10:58 | if ($tag000_pos06 == 'u') { | |
10:58 | $recType = 'unknown'; | |
10:58 | } elseif ($tag000_pos06 == 'v') { | |
10:58 | $recType = 'multipart item holdings'; | |
10:58 | } elseif ($tag000_pos06 == 'x') { | |
10:58 | $recType = 'single-part item holdings'; | |
10:58 | } elseif ($tag000_pos06 == 'y') { | |
10:58 | $recType = 'serial item holdings'; | |
10:58 | } | |
11:01 | kados | tumer: do you know how to index the leader in zebra? |
11:01 | tumer: if i ever knew I forgot | |
11:01 | tumer | yes |
11:01 | use LDR instead of number | |
11:02 | thd | shedges: is the topical subdivision no more important than the geographic subdivision in a topical subject? |
11:03 | tumer | thd: i am still not sure what u are saying. 2 separate marc records, how do you merge serach them? |
11:03 | kados:i am not very sure now | |
11:03 | kados | shedges: I'm not sure the MODS description is positing an order |
11:04 | tumer: :-) | |
11:04 | shedges | right -- I doubt that it is |
11:04 | I was just trying to cheat! | |
11:05 | tumer | well according to prof lady no4 looks ok but she is not used to see them like that |
11:06 | thd | tumer: well you search for records matching your criteria which are either holdings or bibliographic records and sort them out when you get the results back if you want both or limit the search if you only want one |
11:06 | tumer | they are used to seeing them all in one line in LoC |
11:06 | kados | hehe |
11:06 | thd | if ($targetSyntax == 'MARC 21') { |
11:06 | if ($tag000_pos06 == 'a' || $tag000_pos06 == 'c' || $tag000_pos06 == 'd' || $tag000_pos06 == 'e' || $tag000_pos06 == 'f' || $tag000_pos06 == 'g' || $tag000_pos06 == 'i' || $tag000_pos06 == 'j' || $tag000_pos06 == 'k' || $tag000_pos06 == 'm' || $tag000_pos06 == 'o' || $tag000_pos06 == 'p' || $tag000_pos06 == 'r' || $tag000_pos06 == 't') { | |
11:06 | $recFormat = 'bibliographic format'; | |
11:07 | kados | thd: what is this for? |
11:07 | thd: detecting the format? | |
11:07 | tumer | thd is trying to solve merge issue on the fly |
11:07 | thd | kados: yes and material type etc. |
11:07 | kados | cool |
11:08 | in zebra | |
11:08 | tumer | where is branch where is shelf? |
11:08 | thd | tumer: my z39.50 client has no other choice but to solve merge issues on the fly |
11:08 | kados | the problem is, depending on the leader field, the positions change in 008 |
11:08 | thd | kados: I have code for that |
11:09 | tumer | search on a title i get 1000 results 100 of those are in a specific branch and i want those |
11:09 | bibliograhic record does not hold branch or shelving info i thought | |
11:10 | thd | tumer: use the branch as an additional search criteria as part of the query by inclusion or exclusion and you may need post processing if by inclusion. |
11:11 | tumer: yes the bibliographic record may have no holdings so then you have to search for linked holdings | |
11:12 | tumer: by specifying a search for 003 in holdings records taken from 001 in bibliographic records | |
11:12 | tumer | well if they are separate records and i search title=sometithing branch=MAIN i am bound to get 0 resuts |
11:13 | thd:004 i presume | |
11:13 | and it could take houlf hor doing that i tried | |
11:13 | s/hor/hour | |
11:14 | thd | tumer: what could take half an hour? |
11:14 | tumer | once i receive bibliographic records to recursively search holdings |
11:16 | thd | tumer: how long if you run two separate searches and merge them with Perl instead of asking Zebra to do the work? |
11:16 | tumer | for that i have to receive all the records and that is sometimes in 10,000+ |
11:17 | thd | tumer: yes that is a problem for a large result set |
11:18 | tumer | well results upto 20-50 is bearable and i do it |
11:19 | thd | tumer: certainly it can be done for results one page at a time but it should be reasonable to do for an arbitrary result set. |
11:20 | s/it should be/there should be a method/ | |
11:22 | tumer: the problem is that we need a way to create a common index for both holdings and bibliographic records while keeping the records separate | |
11:23 | tumer | why do we need that? |
11:23 | thd | tumer: so you can search both as if they were in one record |
11:23 | tumer: for speed | |
11:24 | tumer | zebra does not work that way |
11:24 | a record is a record separate entity | |
11:25 | how will zebra know which holdings belong to which biblio to join them | |
11:25 | thd | tumer: you have found that there is no way to cross index multiple databases or is the problem of combining the results from multiple databases in a single results set |
11:25 | ? | |
11:27 | tumer | thd: if i search on common indexes like biblionumber i retrive both biblio and the related holdings |
11:27 | whether in same or differnt databases | |
11:27 | the problem still persists... | |
11:27 | thd | tumer: what then is the problem? |
11:27 | dewey | then the problem is, like, that you still want a unique call number for all the material that you have in the collection |
11:28 | thd | tumer: is the problem what dewey said? |
11:29 | tumer: I hope that you do not have the problem that dewey was repeating. | |
11:34 | tumer: is the problem that you have Zebra indexing by field instead of by record type so that Zebra is not matching 001 in a bibliographic record to 004 in a holdings record for indexing purposes? | |
11:37 | tumer: if that is the problem, one of a general kind for Zebra, which I feared some months ago. If Zebra will not change then we need to build at least our own meta-indexing system which uses Zebra for at least storage if not some indexing. | |
11:40 | tumer | thd:correct |
11:41 | thd | tumer: my hope is that Index Data would fix the obstacle first. |
11:41 | tumer | i hope |
11:42 | thd | tumer: yet, I do not think that building a complimentary index would be difficult but I have not thought it through fully. |
11:44 | tumer: I think it quite simple now | |
11:45 | tumer: but much less efficient than if Zebra does it all. | |
11:48 | tumer: have you or anyone asked Index Data about this indexing design problem specifically? | |
11:50 | tumer: this will be a much worse problem for doing anything interesting with authorities will it not? | |
11:52 | tumer: I mean for authorities we would have the same problem but not even have the option of authorities data for tracings and references fields, etc. in the bibliographic records in the way that we have holdings fields in the bibliographic records. | |
11:55 | tumer: so who has asked Index Data about more flexible use of indexes for matching different record types or other similar purposes? | |
11:56 | kados: are you paying attention? |
← Previous day | Today | Next day → | Search | Index