在包含 1 亿个字符串的大型文本文件中进行高效的子字符串搜索(无重复字符串)-Java问题

Efficient substring search in a large text file containing 100 millions strings(no duplicate string)(在包含 1 亿个字符串的大型文本文件中进行高效的子字符串搜索(无重复字符串))

本文介绍了在包含 1 亿个字符串的大型文本文件中进行高效的子字符串搜索(无重复字符串)的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个包含 1 亿个字符串(没有重复字符串)的大型文本文件(1.5 Gb)，并且所有字符串在文件中逐行排列.我想在java中制作一个wepapplication，以便当用户给出关键字(子字符串)时，他可以获得包含该关键字的文件中存在的所有字符串的计数.我已经知道一种技术 LUCENE..还有其他方法可以做到这一点吗??我希望在 3-4 秒内得到结果.我的系统有 4GB 内存和双核配置....需要在仅限 JAVA"中执行此操作

I have a large text file(1.5 Gb) having 100 millions Strings(no duplicate String) and all the Strings are arranged line by line in the file . i want to make a wepapplication in java so that when user give a keyword(Substring) he get the count of All the strings present in the file which contains that keyword. i know one technique LUCENE already..is there any other way to do this.?? i want the result within 3-4 seconds. MY SYSTEM HAS 4GB RAM AND DUAL CORE configuration.... need to do this in "JAVA ONLY"

问题描述

推荐答案