关于org.apache.lucene.queryParser.ParseException:Encountered""解决方法

作者：GUOQIFENG_534 | 来源：互联网 | 2023-01-05 17:07

现象：org.apache.lucene.queryParser.ParseException:Encountered&quot;&amp;lt;E

现象：

org.apache.lucene.queryParser.ParseException: Encountered "" at line 1, column 0.
Was expecting one of:
     ...
    "+" ...
    "-" ...
    "(" ...
     ...
     ...
     ...
     ...
    "[" ...
    "{" ...
     ...
    
at org.apache.lucene.queryParser.QueryParser.generateParseException(QueryParser.java:1226)
at org.apache.lucene.queryParser.QueryParser.jj_consume_token(QueryParser.java:1109)
at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:759)
at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:684)
at ch2.lucenedemo.process.Test.RunVsIndex(Test.java:142)
at ch2.lucenedemo.process.Test.main(Test.java:169)

方法一：

如果出现了下列错误，那是因为用错了函数。把queryParser.Query改称queryParser.parse就通过了

方法二：

1、提问：

I am working on a classification problem to classify product reviews as positive, negative or neutral as per the training data using Lucene API.

I am using an ArrayList of Review objects - "reviewList" that stores the attributes for each review while crawling the web pages.

The review attributes which include "polarity" & "review content" are then indexed using the indexer. Thereafter, based on the indexes objects, I need to classify the remaining review objects. But while doing so, there is a review object for which the Query parser is encountering an EOF character in the "review content", and hence terminating.

The line causing error has been commented accordingly -

IndexReader reader = IndexReader.open(FSDirectory.open(new File("index")));
    IndexSearcher searcher = new IndexSearcher(reader);
    Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_31);
    QueryParser parser = new QueryParser(Version.LUCENE_31, "Review", analyzer);

    int length = Crawler.reviewList.size();
    for (int i = 200; i  3.0) {
            true_class = "1";
        } else {
            true_class = "0";
        }

        String[] reviewTokens = Crawler.reviewList.get(i).getReview().split(" ");
        String parsedReview = "";

        int j;

        for (j = 0; j


I've parsed the text manually to remove the characters that are causing the error, apart from checking if the next string is null...but the error persists.
This is the error stack trace -
Exception in thread "main" org.apache.lucene.queryParser.ParseException: Cannot parse 'I made the choice ... be all "thumbs ': Lexical error at line 1, column 938.  Encountered:  after : "\"thumbs "
at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:216)
at Sentiment_Analysis.Classification.classify(Classification.java:58)
at Sentiment_Analysis.Main.main(Main.java:17)
Caused by: org.apache.lucene.queryParser.TokenMgrError: Lexical error at line 1, column 938.  Encountered:  after : "\"thumbs "
at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(QueryParserTokenManager.java:1229)
at org.apache.lucene.queryParser.QueryParser.jj_scan_token(QueryParser.java:1709)
at org.apache.lucene.queryParser.QueryParser.jj_3R_2(QueryParser.java:1598)
at org.apache.lucene.queryParser.QueryParser.jj_3_1(QueryParser.java:1605)
at org.apache.lucene.queryParser.QueryParser.jj_2_1(QueryParser.java:1585)
at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:1280)
at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:1266)
at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:1313)
at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:1266)
at org.apache.lucene.queryParser.QueryParser.TopLevelQuery(QueryParser.java:1226)
at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:206)
... 2 more
Java Result: 1
Please help me solve this problem...have been banging my head with this for hours now!
2、问答
You should escape the double quote and other special characters via
Query query = parser.parse(QueryParser.escape(parsedReview));
As the QueryParser.escape Javadoc suggested,
Returns a String where those characters that QueryParser expects to be escaped are escaped by a preceding '\'.
小结：使用 QueryParser的静态方法QueryParser.escape(string s)，进行自动转义特殊字符后再进行关键字的查询
 
原文出处：
现象及方法一：
https://blog.csdn.net/tengdazhang770960436/article/details/17881671
方法二：
https://***.com/questions/10259907/lucene-exception-query-parser-encountered-eof-after-some-word




    
        
                        apache
                        lucene
                        io
                        java
                        sum
                        token
                        h2
                        process
                        main
                    
    



    
        写下你的评论吧 !
        
            
                吐个槽吧,看都看了
            
            
                
                                        会员登录 | 用户注册
                                    
                
            
        

        
    

    
        推荐阅读
        
            
                                
                    
                        split
                        Python爬虫中使用正则表达式的方法和注意事项
                    

                    
                                                
                            
                        
                                                
                        本文介绍了在Python爬虫中使用正则表达式的方法和注意事项。首先解释了爬虫的四个主要步骤，并强调了正则表达式在数据处理中的重要性。然后详细介绍了正则表达式的概念和用法，包括检索、替换和过滤文本的功能。同时提到了re模块是Python内置的用于处理正则表达式的模块，并给出了使用正则表达式时需要注意的特殊字符转义和原始字符串的用法。通过本文的学习，读者可以掌握在Python爬虫中使用正则表达式的技巧和方法。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-12 11:51:07
                    

                

                
                                
                    
                        web
                        如何使用Java获取服务器硬件信息和磁盘负载率
                    

                    
                                                
                            
                        
                                                
                        本文介绍了使用Java编程语言获取服务器硬件信息和磁盘负载率的方法。首先在远程服务器上搭建一个支持服务端语言的HTTP服务，并获取服务器的磁盘信息，并将结果输出。然后在本地使用JS编写一个AJAX脚本，远程请求服务端的程序，得到结果并展示给用户。其中还介绍了如何提取硬盘序列号的方法。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-14 13:56:20
                    

                

                                
                    
                    
                
                
                                
                    
                        import
                        Java容器中的compareto方法排序原理解析
                    

                    
                                                
                            
                        
                                                
                        本文从源码解析Java容器中的compareto方法的排序原理，讲解了在使用数组存储数据时的限制以及存储效率的问题。同时提到了Redis的五大数据结构和list、set等知识点，回忆了作者大学时代的Java学习经历。文章以作者做的思维导图作为目录，展示了整个讲解过程。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-14 13:53:31
                    

                

                
                                
                    
                        split
                        第四章高阶函数（参数传递、高阶函数、lambda表达式）（python进阶）的讲解和应用
                    

                    
                                                
                            
                        
                                                
                        本文主要讲解了第四章高阶函数（参数传递、高阶函数、lambda表达式）的相关知识，包括函数参数传递机制和赋值机制、引用传递的概念和应用、默认参数的定义和使用等内容。同时介绍了高阶函数和lambda表达式的概念，并给出了一些实例代码进行演示。对于想要进一步提升python编程能力的读者来说，本文将是一个不错的学习资料。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-12 15:52:48
                    

                

                
                                
                    
                        split
                        Python自动提取文本中的时间（包含中文日期）及特殊时间识别方法
                    

                    
                                                
                        本文介绍了在处理不规则数据时如何使用Python自动提取文本中的时间日期，包括使用dateutil.parser模块统一日期字符串格式和使用datefinder模块提取日期。同时，还介绍了一段使用正则表达式的代码，可以支持中文日期和一些特殊的时间识别，例如'2012年12月12日'、'3小时前'、'在2012/12/13哈哈'等。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-12 12:09:33
                    

                

                
                                
                    
                        web
                        node . js urlsearchparams API
                    

                    
                                                
                        node.jsurlsearchparamsAPI哎哎哎 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-09 18:08:10
                    

                

                
                                
                    
                        web
                        Java序列化对象传给PHP的方法及原理解析
                    

                    
                                                
                            
                        
                                                
                        本文介绍了Java序列化对象传给PHP的方法及原理，包括Java对象传递的方式、序列化的方式、PHP中的序列化用法介绍、Java是否能反序列化PHP的数据、Java序列化的原理以及解决Java序列化中的问题。同时还解释了序列化的概念和作用，以及代码执行序列化所需要的权限。最后指出，序列化会将对象实例的所有字段都进行序列化，使得数据能够被表示为实例的序列化数据，但只有能够解释该格式的代码才能够确定数据的内容。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-14 15:25:15
                    

                

                
                                
                    
                        web
                        如何用UE4制作2D游戏文档——计算篇
                    

                    
                                                
                            
                        
                                                
                        篇首语：本文由编程笔记#小编为大家整理，主要介绍了如何用UE4制作2D游戏文档——计算篇相关的知识，希望对你有一定的参考价值。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-14 09:50:34
                    

                

                
                                
                    
                        js
                        关于cuowu类的错误提示和使用AdjustmentListener的问题
                    

                    
                                                
                        本文讨论了一个关于cuowu类的问题，作者在使用cuowu类时遇到了错误提示和使用AdjustmentListener的问题。文章提供了16个解决方案，并给出了两个可能导致错误的原因。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-13 22:09:56
                    

                

                
                                
                    
                        js
                        XML介绍与使用的概述及标签规则
                    

                    
                                                
                            
                        
                                                
                        本文介绍了XML的基本概念和用途，包括XML的可扩展性和标签的自定义特性。同时还详细解释了XML标签的规则，包括标签的尖括号和合法标识符的组成，标签必须成对出现的原则以及特殊标签的使用方法。通过本文的阅读，读者可以对XML的基本知识有一个全面的了解。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-13 17:39:50
                    

                

                
                                
                    
                        range
                        不同优化算法的比较分析及实验验证
                    

                    
                                                
                            
                        
                                                
                        本文介绍了神经网络优化中常用的优化方法，包括学习率调整和梯度估计修正，并通过实验验证了不同优化算法的效果。实验结果表明，Adam算法在综合考虑学习率调整和梯度估计修正方面表现较好。该研究对于优化神经网络的训练过程具有指导意义。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-13 16:05:14
                    

                

                
                                
                    
                        web
                        PDO MySQL
                    

                    
                                                
                            
                        
                                                
                        PDOMySQL如果文章有成千上万篇，该怎样保存？数据保存有多种方式，比如单机文件、单机数据库（SQLite）、网络数据库（MySQL、MariaDB）等等。根据项目来选择，做We ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-12 10:25:39
                    

                

                
                                
                    
                        split
                        如何优化Webpack打包后的代码分割
                    

                    
                                                
                            
                        
                                                
                        本文介绍了如何通过优化Webpack的代码分割来减小打包后的文件大小。主要包括拆分业务逻辑代码和引入第三方包的代码、配置Webpack插件、异步代码的处理、代码分割重命名、配置vendors和cacheGroups等方面的内容。通过合理配置和优化，可以有效减小打包后的文件大小，提高应用的加载速度。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-12-09 08:10:47
                    

                

                
                                
                    
                        include
                        C语言自带的快排和二分查找
                    

                    
                                                
                            
                        
                                                
                        Author🚹:CofCaiEmail✉️:cai.dongjunnexuslink.cnQQ😙:1664866311personalPage&#x ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-10-17 21:23:47
                    

                

                
                                
                    
                        split
                        颜色迁移（reinhard VS welsh）
                    

                    
                                                
                            
                        
                                                
                        不要谈什么天分，运气，你需要的是一个截稿日，以及一个不交稿就能打爆你狗头的人，然后你就会被自己的才华吓到。------ ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2023-10-17 21:20:36

















    

    
        
            
            
                
                
            

            
                GUOQIFENG_534            

            
                这个家伙很懒，什么也没留下！            


        
    

    
    

    
    

    
        Tags | 热门标签
        
            
                                
                    settings
                
                                
                    cmd
                
                                
                    spring
                
                                
                    triggers
                
                                
                    node.js
                
                                
                    byte
                
                                
                    import
                
                                
                    testing
                
                                
                    case
                
                                
                    range
                
                                
                    vbscript
                
                                
                    cSharp
                
                                
                    actionscrip
                
                                
                    const
                
                                
                    php7
                
                                
                    js
                
                                
                    cpython
                
                                
                    express
                
                                
                    nodejs
                
                                
                    include
                
                                
                    php
                
                                
                    schema
                
                                
                    export
                
                                
                    replace
                
                                
                    join
                
                                
                    tags
                
                                
                    emoji
                
                                
                    command
                
                                
                    web
                
                                
                    split
                
                                
            
        
    

    
    
        
            
            
        
        RankList | 热门文章
        
            
                                
                    1c/c++开发分享你如何写入C中的txt文件的特定行？
                
                                
                    2行车自动落锁怎么设置？:名爵车门锁设置
                
                                
                    3顺应新趋势而变，配置再升级，昂科旗全系搭载48V昂科旗设置
                
                                
                    4每日一皮：据说PM就是这样忍受你的！
                
                                
                    54.1指令系统
                
                                
                    6CAD迷你看图 for Mac(MiniCAD)
                
                                
                    7PS调出竹林婚片粉嫩的黄青色
                
                                
                    8《人月神话》笔记：外科手术队伍
                
                                
                    9小编教你电脑没声音怎么办
                
                                
                    10深圳十大驾校排名 深港驾校第一，鹏城驾校上榜
                
                                
                    11问个office的问题
                
                                
                    12win7专业版怎么升级win10？
                
                                
                    13华为问界汽车直降3万硬刚特斯拉，你怎么看？
                
                                
                    14苹果手机调整备忘录音频质量的方法
                
                                
                    15用彩视相册怎么制作影集？彩视相册制作教程？