[Isis-users] Authority data base

Edwin Hübner edwin.hubner at gmail.com
Fri Aug 2 17:53:13 CEST 2019


Hi Egbaert,
I agree with you, but who should add it  the to ABCD wiki? Can you do it?
Regards,
Edwin Hübner
(5521)99647-1675


Em sex, 2 de ago de 2019 às 04:25, Egbert De Smet <
egbert.desmet em uantwerpen.be> escreveu:

> Edwin,
>
>
> well, it took some tedious and systematic analysis to finally get there,
> but I am very glad we managed.
>
> Maybe it should be added to the ABCD wiki on this technique that if the
> 'extract as' part doesn't work it returns the 'list as' value. I think this
> is already mentioned in the general description of 'list as' and 'extract
> as' but not for this special technique with the more advanced use of a PFT
> in the 'list as' part.
>
> I am also glad that once again we proved that ABCD is capable of dealing
> with any situation...
>
>
> Egbert de Smet
> Universiteit Antwerpen
>
>
> ------------------------------
> *From:* Edwin Hübner <edwin.hubner em gmail.com>
> *Sent:* Thursday, August 1, 2019 9:18 PM
> *To:* Egbert De Smet
> *Cc:* Maria Mercedes MacLean; isis-users
> *Subject:* Re: [Isis-users] Authority data base
>
> Yes, Egbert, it works with "case 400: v400*4'$$$'v100[1]"
> I am very grateful for your patience in helping me to solve this "mystery".
> Thank you very mutch.
> Edwin Hübner
> (5521)99647-1675
>
>
> Em qui, 1 de ago de 2019 às 14:01, Egbert De Smet <
> egbert.desmet em uantwerpen.be> escreveu:
>
>> Edwin,
>>
>>
>> then I think this is the problem : maybe the mechanism is trying to
>> retrieve the same occurrence (e.g. 2nd) of v100 and when failing returns
>> v400 itself.
>>
>> So then try to put v100[1] after the '$$$' as 'extract as' PFT. That
>> specifically instructs to get the 1st occurrence of the v100 field.
>>
>> Well, a simple experiment to find out.
>>
>>
>> Egbert de Smet
>> Universiteit Antwerpen
>>
>>
>> ------------------------------
>> *From:* Edwin Hübner <edwin.hubner em gmail.com>
>> *Sent:* Thursday, August 1, 2019 6:44 PM
>> *To:* Maria Mercedes MacLean
>> *Cc:* Egbert De Smet; isis-users
>> *Subject:* Re: [Isis-users] Authority data base
>>
>> Hi Maria Mercedes and Egbert,
>> Yes, I have changed the fst of 400 to
>> '/AU_/',(v400^a,| |v400^b,| |v400^c,| |v400^d,| - |v400^x,'%'/),
>>
>> Field 100 always has only one occurrence, as in this example:
>> 100 ®1#^aAguilar, José Roberto, ^d1941-^xAcervo¯
>> 400 ®1#^aJosé Roberto Aguilar.¯
>> 400 ®##^aAguilar.¯
>> Regards,
>> Edwin Hübner
>> (5521)99647-1675
>>
>>
>> Em qua, 31 de jul de 2019 às 19:45, Maria Mercedes MacLean <
>> gaspyy em infovia.com.ar> escreveu:
>>
>>> Edwin, estoy siguiendolos a ti y a Egbert…
>>>
>>> La FST sigue asi?
>>>
>>> 100 5 p(v100^a del mpu,if) entonces '/ AU_ / ', v100^a, | | el v100^b, |
>>> | el v100^c, | | el v100^d, | - | el v100^x, '% ', fi,
>>>
>>>
>>>
>>> 400 5 p(v400^a del mpu,if) entonces '/ AU_ / ', (el v400^a, | | el
>>> v400^b, | | el v400^c, | | el v400^d, | - | el v400^x |% | /), fi,
>>>
>>>
>>>
>>> No entiendo si cambiaste algo…
>>>
>>> Gracias!!!
>>>
>>> Saludito a los dos.
>>>
>>> María Mercedes MacLean
>>>
>>> gaspyy60 em gmail.com
>>>
>>> mamemaclean em gmail.com
>>>
>>> gaspyy2002 em yahoo.com.ar
>>>
>>> Visite nuestros proyectos
>>>
>>> www.proyectoleonardo.com.ar/proleon
>>>
>>>
>>>
>>>
>>>
>>>
>>> ------------------------------
>>>
>>> *De:* isis-users [mailto:isis-users-bounces+gaspyy=
>>> infovia.com.ar em iccisis.org] *En nombre de *Edwin Hübner
>>> *Enviado el:* miércoles, 31 de julio de 2019 06:58 p.m.
>>> *Para:* Egbert De Smet
>>> *CC:* isis-users
>>> *Asunto:* Re: [Isis-users] Othority data base
>>>
>>>
>>>
>>> Hi Egbert,
>>>
>>> Based on your explanation, I found out that there really was a problem
>>> with my fst. At the end of the line I had ^x|%|, so the % would only be
>>> used if there exist a ^x, I changed it to ^x,'%' and now the print of the
>>> command mx author fst=@ shows ^o1 and ^o2, as you can see,
>>>
>>> 100  «AU_AGUILAR, JOSÚ ROBERTO,  1941- - ACERVO^m10^o1^c1^l2»
>>> 400  «AU_JOSÚ ROBERTO AGUILAR.^m10^o1^c1^l2»
>>> 400  «AU_AGUILAR.^m10^o2^c1^l1»
>>>
>>> But it didn't solve the problem. When I click on Aguilar (
>>> the second occurrence of field 400) it, assigns the same Aguilar and not
>>> Aguilar, José Roberto ...
>>>
>>> Edwin Hübner
>>> (5521)99647-1675
>>>
>>>
>>>
>>>
>>>
>>> Em qua, 31 de jul de 2019 às 11:29, Egbert De Smet <
>>> egbert.desmet em uantwerpen.be> escreveu:
>>>
>>> Edwin,
>>>
>>>
>>>
>>> according to your output under here, both occurrences are stored as
>>> occurrence 1 (^o1)... so that could explain your problem.
>>>
>>> Are you sure you are indexing with the line
>>>
>>>      400 5 mpu,if p(v400^a) then '/AU_/',(v400^a,| |v400^b,| |v400^c,|
>>> |v400^d,| - |v400^x|%|/),fi,
>>>
>>> ?
>>>
>>> Anyway it marks both occurrences as occ1.
>>>
>>>
>>>
>>> What is your full FST ?
>>>
>>>
>>>
>>> Egbert de Smet
>>> Universiteit Antwerpen
>>>
>>>
>>> ------------------------------
>>>
>>> *From:* Edwin Hübner <edwin.hubner em gmail.com>
>>> *Sent:* Wednesday, July 31, 2019 3:44 PM
>>> *To:* Egbert De Smet
>>> *Cc:* Soraya Ferraz Lima; isis-users
>>> *Subject:* Re: [Isis-users] Othority data base
>>>
>>>
>>>
>>>   Google Tradutor
>>>
>>> It seems to be correct. I used the command mx autor fst=@ and an
>>> example of fields 100 and 400, which has two occurrences, is as follows:
>>>
>>> 100  «AU_AGUILAR, JOSÚ ROBERTO,  1941- - ACERVO^m10^o1^c1^l2»
>>> 400  «AU_JOSÚ ROBERTO AGUILAR.^m10^o1^c1^l2»
>>> 400  «AU_AGUILAR.^m10^o1^c2^l1»
>>>
>>> Thanks,
>>>
>>> Edwin Hübner
>>> (5521)99647-1675
>>>
>>>
>>>
>>>
>>>
>>> Em qua, 31 de jul de 2019 às 10:13, Egbert De Smet <
>>> egbert.desmet em uantwerpen.be> escreveu:
>>>
>>> Edwin,
>>>
>>>
>>>
>>> then most probably the script already puts the PFT in a loop, creating a
>>> 'nested repeatable format' which results in the known 'semantic error' of
>>> wxis (CISIS).
>>>
>>> You could test your records in the terminal with mx to check whether the
>>> ID and OCC are stored correctly, e.g. if your database is named 'test' and
>>> you have a FST 'test.fst' you could use the following command :
>>>
>>> mx test fst=@
>>>
>>> without anything more it would present each record one by one just on
>>> the screen with first the listing of the field-contents, followed by the
>>> FST-output, which looks like e.g. :
>>>
>>> mfn=     1
>>>   1  "1"
>>>   1  "2"
>>>   1  "V1_1^m1^o1^c1^l1"
>>>   1  "V1_2^m1^o2^c1^l1"
>>>  11  "V11_1^m1^o1^c1^l1"
>>>  11  "V11_2^m1^o2^c1^l1"
>>> ..
>>>
>>> This is a single record with 2 occurrences of v1 with resp. values 1 and
>>> 2, but indexed with two different ID's (1 and 11 and accordingly different
>>> prefixes). The interesting part is ^m1^o2^c1^l1 which indicates in ^o the
>>> occurrence counter, in this case 2.
>>>
>>> So now please check if your author-names have different occurrence
>>> counters reflecting their position in v100 and v400. If they are correct
>>> then it might be a bug indeed of the '$$$' extract-as technique with the
>>> autoridades.pft, because then ABCD should be able to correctly extract the
>>> same occurrence of v100 as the one read (listed as) of v400.
>>>
>>>
>>>
>>> Egbert de Smet
>>> Universiteit Antwerpen
>>>
>>>
>>> ------------------------------
>>>
>>> *From:* Edwin Hübner <edwin.hubner em gmail.com>
>>> *Sent:* Wednesday, July 31, 2019 3:01 PM
>>> *To:* Egbert De Smet
>>> *Cc:* Soraya Ferraz Lima; isis-users
>>> *Subject:* Re: [Isis-users] Othority data base
>>>
>>>
>>>
>>> Dear Egbert,
>>>
>>> Yes, I had already tested this alternative  before, using in the
>>> autoridades.pft
>>>
>>> case 400: (v400*4/)'$$$'(v100/)  or
>>>
>>> case 400: (v400*4/)'$$$'v100
>>>
>>> but unfortunately presents the following error:
>>>
>>> WXIS|fatal error|unavoidable|fmt_gener/semantic_error|
>>>
>>>
>>>
>>> Regards,
>>>
>>> Edwin Hübner
>>> (5521)99647-1675
>>>
>>>
>>>
>>>
>>>
>>> Em qua, 31 de jul de 2019 às 09:19, Egbert De Smet <
>>> egbert.desmet em uantwerpen.be> escreveu:
>>>
>>> Edwin,
>>>
>>>
>>>
>>> OK, then your 'occurrence counter' must be correctly stored in your IF,
>>> so that doesn't explain the problem.
>>>
>>> Maybe you could try - as I don't have easily good testing material at
>>> hand here - to use the brackets to make the field treated as a repeatable
>>> field :
>>>
>>> case 400: (v400*4/)'$$$'(v100/)
>>>
>>> so as to force both the 'list as' and 'extract as' values to come as
>>> separate entries.
>>>
>>> I don't see any examples of such repeated groups in the series of
>>> 'autoridades.pft' in the ABCD-demo databases, but in the ABCDWIKI pages on
>>> this issue (
>>> http://abcdwiki.net/wiki/es/index.php?title=Control_de_terminolog%C3%ADa:_Listas_de_autoridades)
>>> I can see that it simply refers to 'the format to be used to transfer the
>>> field in the record' (el formato a utilizar para transferir el campo al
>>> registro (extraer como), so in principle a repeatable group should be
>>> allowed and ensures that there is an occurrence counter being followed as
>>> that seems to be the problem.
>>>
>>> I have no idea whether it would make a meaningful difference but it's
>>> worth a try. Otherwise Guilda Ascencio should be called to the rescue
>>> to check her implementation in the PHP-scripts (or the IsisScript).
>>>
>>>
>>>
>>> By the way, the FST examples of the WIKI-page use the method  '0' for
>>> the indexing but that should not create any difference in the resulting
>>> strings :
>>>
>>> 400  0 mpu,(|AU_|v400^*,|, |V400^b,'%'/)/
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> Egbert de Smet
>>> Universiteit Antwerpen
>>>
>>>
>>> ------------------------------
>>>
>>> *From:* Edwin Hübner <edwin.hubner em gmail.com>
>>> *Sent:* Wednesday, July 31, 2019 1:08 PM
>>> *To:* Soraya Ferraz Lima
>>> *Cc:* Egbert De Smet; isis-users
>>> *Subject:* Re: [Isis-users] Othority data base
>>>
>>>
>>>
>>> Hi Egbert,
>>>
>>> My fst for fields 100 and 400 is as follows:
>>>
>>> 100 5 mpu,if p(v100^a) then '/AU_/',v100^a,| |v100^b,| |v100^c,|
>>> |v100^d,| - |v100^x,'%',fi,
>>>
>>> 400 5 mpu,if p(v400^a) then '/AU_/',(v400^a,| |v400^b,| |v400^c,|
>>> |v400^d,| - |v400^x|%|/),fi,
>>>
>>>
>>>
>>> Remember that when I click on a name that is the first occurrence of
>>> field 400, it assigns the name of field 100 (as expected), but if the name
>>> that is clicked is another occurrence, it assigns field 400 instead of 100
>>>
>>>
>>>
>>> Thanks,
>>>
>>> Edwin Hübner
>>> (5521)99647-1675
>>>
>>>
>>>
>>>
>>>
>>> Em qua, 31 de jul de 2019 às 07:12, Soraya Ferraz Lima <
>>> soyferrazlima em gmail.com> escreveu:
>>>
>>> Hi Edwin!
>>>
>>>
>>>
>>> Here we use like this
>>>
>>>
>>>
>>> Case 400: v400^a, '$$$', v700
>>>
>>>
>>>
>>> Since the first column is for the field list and after the $$$ is how it
>>> will be stored.
>>>
>>>
>>>
>>> Fst 400 0 |AU_|v400^a
>>>
>>>
>>>
>>> At least that is the way I understand
>>>
>>> Em terça-feira, 30 de julho de 2019, Edwin Hübner <
>>> edwin.hubner em gmail.com> escreveu:
>>>
>>> Thank you Egbert for your interest in helping. Sorry I didn't explain it
>>> more clearly because it was clear to me.
>>>   Please see further explanation in the attached document.
>>>
>>> Regards,
>>>
>>> Edwin Hübner
>>> (5521)99647-1675
>>>
>>>
>>>
>>>
>>>
>>> Em ter, 30 de jul de 2019 às 03:45, Egbert De Smet <
>>> egbert.desmet em uantwerpen.be> escreveu:
>>>
>>> Edwin,
>>>
>>>
>>>
>>> sorry but your problem and request is not that clear to me. Probably
>>> that is also why no reply came...
>>>
>>> Can you please explain the exact situation : are you clicking on an
>>> occurrence in a PFT (a hyperlink ?), in a picklist of a worksheet or what
>>> exactly do you want to get ?
>>>
>>> But before everything : please note that v100 * 4 should actually be
>>> v100*4 (no spaces in between). Perhaps that causes problems.
>>>
>>>
>>>
>>> Egbert de Smet
>>> Universiteit Antwerpen
>>>
>>>
>>> ------------------------------
>>>
>>> *From:* isis-users <isis-users-bounces+egbert.desmet=
>>> ua.ac.be em iccisis.org> on behalf of Edwin Hübner <edwin.hubner em gmail.com>
>>> *Sent:* Monday, July 29, 2019 1:19 PM
>>> *To:* isis-users
>>> *Subject:* [Isis-users] Othority data base
>>>
>>>
>>>
>>> Dear,
>>>
>>> I have asked for this problem before, but didn't receive help. So I'm
>>> asking again:
>>>
>>>
>>> I have a MARC bibliographic database and a authority database for the
>>> author names normalization. The problem is: when field 400 - See tracing,
>>> for example, has more than one occurrence and is clicked on a name other
>>> than the first occurrence, ABCD does not assign the name of field 100, as
>>> expected, but the name of field 400 it self. I wonder if anyone has had
>>> this problem and how did solve it?
>>>
>>>
>>> My autoridades.pft is as follows:
>>> select e3
>>> case 100: v100 * 4 '$$$' v100
>>> case 110: v110 * 4 '$$$' v110
>>> case 111: v111 * 4 '$$$' v111
>>> case 400: v400 * 4 '$$$' v100
>>> case 410: v410 * 4 '$$$' v110
>>> case 411: v411 * 4 '$$$' v111
>>> endsel
>>> Thanks for any help.
>>>
>>> Edwin Hübner
>>> (5521)99647-1675
>>>
>>>
-------------- Próxima Parte ----------
Um anexo em HTML foi limpo...
URL: <http://lists.iccisis.org/pipermail/isis-users/attachments/20190802/725cfba6/attachment.html>


More information about the isis-users mailing list