Terrier Users :  Terrier Forum terrier.org
General discussion about using/developing applications using Terrier 
Could not retrieve FIRE data
Posted by: hardik ()
Date: December 20, 2011 07:18AM

Hi,

Please solve the following doubt:

Whether the topics file must be with .txt or .gz extension ?

I am new to Terrier & am trying (ad hoc - batch) retrieval for FIRE 2011 data for Gujarati language and got the following Error :

[root@agni bin]# ./trec_terrier.sh -r
Setting TERRIER_HOME to /home/hardik/terrier/terrier-3.5
Setting JAVA_HOME to /usr
INFO - Loading document lengths for document structure into memory
INFO - Structure meta reading lookup file into memory
INFO - Structure meta reading reverse map for key docno directly from disk
INFO - Structure meta loading data file into memory
INFO - time to intialise index : 0.272
ERROR - Error instantiating topic file QuerySource called TRECQuery
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstruct
orAccessorImpl.java:57)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingC
onstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:532)
at org.terrier.applications.TRECQuerying.getQueryParser(TRECQuerying.jav
a:797)
at org.terrier.applications.TRECQuerying.<init>(TRECQuerying.java:344)
at org.terrier.applications.TrecTerrier.run(TrecTerrier.java:393)
at org.terrier.applications.TrecTerrier.applyOptions(TrecTerrier.java:56
4)
at org.terrier.applications.TrecTerrier.main(TrecTerrier.java:235)
Caused by: java.lang.NullPointerException
at org.terrier.structures.TRECQuery.extractQuery(TRECQuery.java:163)
at org.terrier.structures.TRECQuery.extractQuery(TRECQuery.java:87)
at org.terrier.structures.TRECQuery.<init>(TRECQuery.java:272)
... 9 more
A problem occurred: java.lang.NullPointerException
java.lang.NullPointerException
at org.terrier.applications.TRECQuerying.processQueries(TRECQuerying.jav
a:829)
at org.terrier.applications.TrecTerrier.run(TrecTerrier.java:394)
at org.terrier.applications.TrecTerrier.applyOptions(TrecTerrier.java:56
4)
at org.terrier.applications.TrecTerrier.main(TrecTerrier.java:235)

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: craigm ()
Date: January 06, 2012 05:05PM

> Whether the topics file must be with .txt or .gz extension ?

Terrier can decompress .gz automatically.

I think the problem is that your topics file doesnt have a <NUM> tag, or whatever TrecQueryTags.idtag is set to.

Can you post a sample of topics file, and the TrecQueryTags properties from your terrier.properties?

Thanks

Craig

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: GARIMA NATANI ()
Date: June 19, 2012 05:35PM

I am working on Terrier3.0 as their is error when i try to index FIRE data set on terrier 3.5 I am trying (ad hoc - batch) retrieval for FIRE 2011 data for English language and got the following Error :


root@mnitjaipur:~/Desktop/work/terrier-3.0$ ./bin/trec_terrier.sh -r -c 1.0
Setting TERRIER_HOME to /home/xyz/Desktop/work/terrier-3.0
Setting JAVA_HOME to /usr
INFO - Loading document lengths for document structure into memory
INFO - Structure meta reading lookup file into memory
INFO - Structure meta reading reverse map for key docno directly from disk
INFO - Structure meta loading data file into memory
INFO - time to intialise index : 0.128
ERROR - Error instantiating topic file tokeniser called TRECQuery
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:532)
at org.terrier.applications.TRECQuerying.getQueryParser(TRECQuerying.java:732)
at org.terrier.applications.TRECQuerying.<init>(TRECQuerying.java:297)
at org.terrier.applications.TrecTerrier.run(TrecTerrier.java:401)
at org.terrier.applications.TrecTerrier.applyOptions(TrecTerrier.java:573)
at org.terrier.applications.TrecTerrier.main(TrecTerrier.java:237)
Caused by: java.lang.NullPointerException
at org.terrier.structures.TRECQuery.extractQuery(TRECQuery.java:156)
at org.terrier.structures.TRECQuery.<init>(TRECQuery.java:199)
... 9 more
A problem occurred: java.lang.NullPointerException
java.lang.NullPointerException
at org.terrier.applications.TRECQuerying.processQueries(TRECQuerying.java:768)
at org.terrier.applications.TrecTerrier.run(TrecTerrier.java:402)
at org.terrier.applications.TrecTerrier.applyOptions(TrecTerrier.java:573)
at org.terrier.applications.TrecTerrier.main(TrecTerrier.java:237)

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: craigm ()
Date: June 19, 2012 06:36PM

This is a parsing problem. The tokeniser didnt find an "id" tag for your queries. Do you have TrecQueryTags set correctly? Can you show a sample of your topics?

Craig

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: GARIMA NATANI ()
Date: June 21, 2012 07:12AM

This is the sample of my topic file.

<topics>

<top lang='en'>
<num>126</num>
<title>Swine flu vaccine</title>
<desc>Indigenous vaccine made in India for swine flu prevention</desc>
<narr>Relevant documents should contain information related to making indigenous swine flu vaccines in India, the vaccine's use on humans and animals, arrangements that are in place to prevent scarcity / unavailability of the vaccine, and the vaccine's role in saving lives.</narr>
</top>

<top lang='en'>
<num>127</num>
<title>Rare cosmic events</title>
<desc>Information about rare solar eclipses, lunar eclipses, meteor showers, changes in position of stellar bodies with respect to Earth</desc>
<narr>Relevant documents should describe the reactions from people interested in these events. Reasons why the events are rare, and the difference in opinion of astrologers and scientists, is relevant. Space crafts, existence of extra-terrestrial life are irrelevant and information on space research is irrelevant. Events that are not designated as rare are not of interest.</narr>
</top>
.
.
.
.
.
<top lang='en'>
<num>175</num>
<title>Sachin Tendulkar's record of runs in Test Cricket</title>
<desc>Sachin Tendulkar's record of runs in Test Cricket</desc>
<narr>Relevant documents should contain information about the records Sachin Tendulkar has broken including his own, in Test Cricket. Recent, new records he has created are also relevant. One-day Cricket and T-20 cricket are not relevant in this context.</narr>
</top>

</topics>



Edited 1 time(s). Last edit at 06/21/2012 10:02AM by GARIMA NATANI.

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: GARIMA NATANI ()
Date: June 26, 2012 05:25PM

problem is solved



Edited 1 time(s). Last edit at 07/02/2012 08:52AM by GARIMA NATANI.

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: craigm ()
Date: July 23, 2012 07:15PM

Any hints for future users?

Craig

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: bharathi ()
Date: October 24, 2016 06:36AM

hai,,

i also having the same problem.can you guide me.


thanks in advance

Options: ReplyQuote
Re: Could not retrieve FIRE data
Posted by: alicec06 ()
Date: September 08, 2017 09:12AM

craigm Wrote:
-------------------------------------------------------
> > Whether the topics file must be with .txt or .gz
> extension ?
>
> Terrier can decompress .gz automatically.
>
> I think the problem is that your topics file
> doesnt have a tag, or whatever
> TrecQueryTags assurances obsèques.idtag is set to.
>
> Can you post a sample of topics file, and the
> TrecQueryTags properties from your
> terrier.properties?
>
> Thanks
>
> Craig

Why not let the worries go and go out tonight. Who's leaving ?

Options: ReplyQuote


Sorry, only registered users may post in this forum.
This forum powered by Phorum.