lucene

search for more blogs here

 

"Re: lucene datatypes" posted by ~Ray
Posted on 2008-11-13 12:19:33

Solr provides semantics on Lucene fields for handling other data types and there are some tools (DateTools. NumberTools) for converting some types to Strings for searching. But yeah. Strings are pretty much the only thing Lucene cares about when it comes to searching.-GrantOn Nov 15. 2007 at 2:24 AM. Heba Farouk wrote:> Hello,>>>> I would like to ask how lucene handles different datatypes or > "String"> is the only available datatype.>>>> Best regards,>>>> Heba Farouk>> Software Engineer>> Bibliotheca Alexandrina>---------------------------------------------------------------------To unsubscribe e-mail: java-user-unsubscribe@lucene apache orgFor additional commands e-mail: java-user-help@lucene apache org

Forex Groups - Tips on Trading

Related article:
http://mail-archives.apache.org/mod_mbox/lucene-java-user/200711.mbox/%3C94F2BB84-A9D8-480F-B85B-7F07E6B3389A@apache.org%3E

comments | Add comment | Report as Spam


"Re: =?GB2312?B?tPC4tDogaG93IHRvIGVmZmVjaWVudGx5IGltcGxlbWVudCB0aA ..." posted by ~Ray
Posted on 2008-01-01 21:15:37

John Wang wrote:> Would payload bring home the bacon?> -John>> Yes if you used payloads instead of stored fields your performanceshould be much better. Try and index one special term per enter (e g score:pagerank) andindex one position with a payload for each doc. Then when you retrievehits open a TermPositions using the special term get the payload andincorporate it in the docs' score. The performance overhead should be comparable to adding one AND-term toyour query.-Michael> On 11/15/07. Zhou Qi <zhou_qi@sjtu edu cn> wrote:> >> Thank you my score is fixed score from the properties of the page but at>> first we need to adjust the score for a promising prove.>> I have tried one way of manually re-ranking all the documents by the>> search results. But it needs to iterate all the retrieved results and channel>> the re-ranking score (stored in the index) to sum the overall advance. It is>> inefficient. How to alter that by a new approach?>>>> Sorry for making you misunderstanding. Thanks>>>> Best Regards,>>>> Zhou Qi>> Dept. Computer Science & Engineering>> Shanghai Jiaotong University>>>> ---------------------------------------------------------------------To unsubscribe e-mail: java-user-unsubscribe@lucene apache orgFor additional commands e-mail: java-user-help@lucene apache org

Forex Groups - Tips on Trading

Related article:
http://mail-archives.apache.org/mod_mbox/lucene-java-user/200711.mbox/%3C473C9108.2080504@gmail.com%3E

comments | Add comment | Report as Spam


"[jira] Resolved: (LUCENE-1052) Add an "termInfosIndexDivisor" to ..." posted by ~Ray
Posted on 2007-12-15 15:03:31

[ ] Michael McCandless resolved LUCENE-1052. ---------------------------------------- Resolution: Fixed I just committed this. Thanks throw & Doug! > Add an "termInfosIndexDivisor" to IndexReader > --------------------------------------------- > > Key: LUCENE-1052 > URL: > Project: Lucene - Java > air write: Improvement > Components: Index > Affects Versions: 2.2 > Reporter: Michael McCandless > Assignee: Michael McCandless > Priority: Minor > Fix For: 2.3 > > Attachments: LUCENE-1052 conjoin > > > The termIndexInterval set during indexing measure let's you tradeoff > how much RAM is used by a reader to fill the indexed terms vs cost of > seeking to the specific call you want to fill. > But the downside is you must set it at indexing time. > This issue adds an indexDivisor to TermInfosReader so that on opening > a reader you could further sub-sample the the termIndexInterval to use > less RAM. EG a setting of 2 means every 2 * termIndexInterval is > loaded into RAM. > This is particularly useful if your index has a great many terms (eg > you accidentally indexed binary terms). > Spinoff from this thread: > -- This message is automatically generated by JIRA. - You can reply to this telecommunicate to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe telecommunicate: java-dev-unsubscribe[at]lucene.apache.org For additional commands e-mail: java-dev-help[at]lucene.apache.org

Forex Groups - Tips on Trading

Related article:
http://www.gossamer-threads.com/lists/lucene/java-dev/54677

comments | Add comment | Report as Spam


"[jira] Resolved: (LUCENE-1050) SimpleFSLockFactory ignores error ..." posted by ~Ray
Posted on 2007-12-09 13:37:50

[ ] Michael McCandless resolved LUCENE-1050. ---------------------------------------- Resolution: Fixed > SimpleFSLockFactory ignores error on deleting the fasten file > ----------------------------------------------------------- > > Key: LUCENE-1050 > URL: > communicate: Lucene - Java > Issue Type: Bug > Components: Store > Affects Versions: 2.2 > Reporter: Michael McCandless > Assignee: Michael McCandless > Priority: Minor > Fix For: 2.3 > > Attachments: LUCENE-1050 conjoin > > > Spinoff from here: > > The Lock channel for SimpleFSLockFactory ignores the go determine of lockFile delete(). I plan to throw a new LockReleaseFailedException subclassing from IOException when this returns false. This is a very minor change to backwards compatibility because all methods in Lucene that release a lock already impel IOException. -- This message is automatically generated by JIRA. - You can reply to this email to add a mention to the issue online. --------------------------------------------------------------------- To unsubscribe telecommunicate: java-dev-unsubscribe[at]lucene.apache.org For additional commands e-mail: java-dev-help[at]lucene.apache.org

Forex Groups - Tips on Trading

Related article:
http://www.gossamer-threads.com/lists/lucene/java-dev/54674

comments | Add comment | Report as Spam


"[jira] Resolved: (LUCENE-1050) SimpleFSLockFactory ignores error ..." posted by ~Ray
Posted on 2007-12-09 13:37:38

[ ] Michael McCandless resolved LUCENE-1050. ---------------------------------------- Resolution: Fixed > SimpleFSLockFactory ignores error on deleting the lock file > ----------------------------------------------------------- > > Key: LUCENE-1050 > URL: > Project: Lucene - Java > air Type: Bug > Components: Store > Affects Versions: 2.2 > Reporter: Michael McCandless > Assignee: Michael McCandless > Priority: Minor > Fix For: 2.3 > > Attachments: LUCENE-1050 conjoin > > > Spinoff from here: > > The Lock release for SimpleFSLockFactory ignores the return determine of lockFile remove(). I plan to impel a new LockReleaseFailedException subclassing from IOException when this returns false. This is a very minor change to backwards compatibility because all methods in Lucene that release a fasten already throw IOException. -- This message is automatically generated by JIRA. - You can say to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe e-mail: java-dev-unsubscribe[at]lucene.apache.org For additional commands telecommunicate: java-dev-help[at]lucene.apache.org

Forex Groups - Tips on Trading

Related article:
http://www.gossamer-threads.com/lists/lucene/java-dev/54674

comments | Add comment | Report as Spam


"[jira] Resolved: (LUCENE-1050) SimpleFSLockFactory ignores error ..." posted by ~Ray
Posted on 2007-12-09 13:37:38

[ ] Michael McCandless resolved LUCENE-1050. ---------------------------------------- Resolution: Fixed > SimpleFSLockFactory ignores error on deleting the lock file > ----------------------------------------------------------- > > Key: LUCENE-1050 > URL: > Project: Lucene - Java > Issue Type: Bug > Components: Store > Affects Versions: 2.2 > Reporter: Michael McCandless > Assignee: Michael McCandless > Priority: Minor > Fix For: 2.3 > > Attachments: LUCENE-1050 conjoin > > > Spinoff from here: > > The Lock release for SimpleFSLockFactory ignores the go value of lockFile delete(). I plan to throw a new LockReleaseFailedException subclassing from IOException when this returns false. This is a very minor change to backwards compatibility because all methods in Lucene that release a lock already impel IOException. -- This communicate is automatically generated by JIRA. - You can say to this email to add a mention to the issue online. --------------------------------------------------------------------- To unsubscribe e-mail: java-dev-unsubscribe[at]lucene.apache.org For additional commands telecommunicate: java-dev-help[at]lucene.apache.org

Forex Groups - Tips on Trading

Related article:
http://www.gossamer-threads.com/lists/lucene/java-dev/54674

comments | Add comment | Report as Spam


"Re: help for a nutch beginner" posted by ~Ray
Posted on 2007-11-17 15:33:07

Hi i can share my times with you. I'm fetching 500 000 pages in each run generating 5 hrsfetching 8 hrsparsing 2.5 hrsupdating 3.5 hrsi undergo about 30 million urls in db right now and those times are for acluster of 3 machines. So yes it takes a lot of measure. I evaluate that usingnative hadoop libraries could speed it up a bit but unfortunately i can'tget them to work on debian. I will switch the cluster to fedora or someother linux that is supported and analyse then but it'll probably be nextweek. I hope you do cognise that you will need huge storage for storingsegments for 100 million pages. Also i'd suggest running nightly build ofnutch as it has hadoop 0.15 which is much more shelter for me.-- Karol RybakProgramista / ProgrammerSekcja aplikacji / Applications sectionWyższa Szkoła Informatyki i Zarządzania / University of Internet Technologyand Management+48(17)8661277

Forex Groups - Tips on Trading

Related article:
http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200711.mbox/%3C5d2a5a3d0711150219k1c4e732am418e6c750ebc2977@mail.gmail.com%3E

comments | Add comment | Report as Spam


 

 




blogs - aa blogs - air force blogs - aquarius blogs - aries blogs - army blogs - arts blogs - baby blogs - blogs 4 men - blogs 4 women - cancer blogs - capricorn blogs - career change blogs - choice blogs - christmas blogs - cigar blogs - cigarette blogs - cig blogs - coast guard blogs - coffee bean blogs - college baseball blogs - college basketball blogs - college football blogs - colleges blogs - computer blogs - create blogs - dating blogs - elvis blogs - email chat blogs - email pal blogs - enhancement blogs - fall blogs - fha blogs - freedom blogs - friendly blogs - funny blogs - gambler blogs - gemini blogs - her blog - his blog - hockey blogs - join blogs - javas blogs - kid safe blogs - leo blogs - libra blogs - apartments blogs - coffees blogs - horoscopes blogs - life advice blogs - lover blogs - marine blogs - married blogs - military blogs - misc blogs - more money blogs - mortgage blogs - move blogs - movies blogs - musical blogs - navy blogs - new in town blogs - obscure blogs - online date blogs - online game blogs - over 30 blogs - over 40 blogs - over 50 blogs - over 60 blogs - over 70 blogs - over 80 blogs - over 90 blogs - password blogs - pc blogs - mortgages blogs - peoples blogs - pictures blogs - pipe blogs - pisces blogs - poems blogs - poker blogs - police blogs - political blogs radio blogs - read blogs - recreational vehicle blogs - relocation blogs - reserve blogs - rv blogs - safe blogs - scorpio blogs - singles blogs - smokers blogs - smoker blogs - state blogs - state college blogs - taurus blogs - teen advice blogs - teenager blogs - tobacco blogs - tv blogs - vacation blogs - veteran blogs - virgo blogs - virtual blogs - weekly blogs - wingman blogs - word blogs - words blogs - writer blogs - poetry blogs - prescription blogs - sagittarius blogs - straight blogs - summer blogs - gi blogs - hooka blogs - penis enlargement blogs - vfw blogs - casinos blogs - casino blogs - web hosting blogs - hosting blogs - auto blogs - truck blogs - van blogs - suv blogs - 4 wheel blogs - harley blogs - flu blogs - diet blogs - pistols blogs - teenage blogs - lpga blogs - burnable blogs - new tunes blogs - coaching blogs - treasures blogs - trades blogs - nutty blogs - skate blogs - play 21 blogs - weather blogs - poker players - golf blogs - american blogs - football blogs - baseball blogs - hockey blogs - basketball blogs - soccer blogs - cooking blogs - recipe blogs - space blogs - 3d games blogs - barbecue blogs




the lucene archives:

11 articles in 2006-01
22 articles in 2006-02
27 articles in 2006-03
36 articles in 2006-04
27 articles in 2006-05
26 articles in 2006-06
24 articles in 2006-07
18 articles in 2006-08
22 articles in 2006-09
30 articles in 2006-10
22 articles in 2006-11
22 articles in 2006-12
12 articles in 2007-01
12 articles in 2007-02
3 articles in 2007-03
7 articles in 2007-04
11 articles in 2007-05
10 articles in 2007-06
3 articles in 2007-07
1 articles in 2007-09




next page


lucene