I m currently working on a Lucene.NET full-text search implementation. For the most part it s going quite well but I m having a few issues revolving around acronyms in the data...
As an example of what s going on if I had "N.A.S.A." in the field I indexed I m able to match it with n.a.s.a. or nasa, but n.a.s.a doesn t match it, not even if I put a fuzzy-search (n.a.s.a~).
The first thought that comes to mind for me is to rip out all the . s before indexing/searching, but it seems a bit more like a workaround than a solution and I was hoping to get a cleaner solution.
Can anyone suggest any changes or a different analyzer (using StandardAnalyzer currently) that may be more suited to matching this kind of data?