How to search for a specific string in a file?

SimpleData · modified on Sunday, March 7, 2010 9:38 AM

It is a great idea. How come I've never thought it. Thanks. :)

SimpleData

Thanks.

Lost User

You're welcome :)

Luc Pattyn · modified on Sunday, March 7, 2010 9:38 AM

You do know it gets a little bit more complex when the characters in the search string aren't all different, as in: find "anas" in "a long text containing ananas and other stuff"; returning to state 0 isn't always right. :)

Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles]

I only read code that is properly formatted, adding PRE tags is the easiest way to obtain that.

Lost User

Ok fine, spoil the fun :) So maybe it isn't 0 but more like X.LastIndexOf(char that was read) (or 0 if it wasn't found) - or if it isn't that I might actually have to think and it's the weekend so no thanks (exercise for the reader?) My brain got unlazy for a second and remembered the solution - see the edit..

modified on Saturday, March 6, 2010 9:56 PM

Shane5555

Similar to StarBP: use 2 rolling buffers which are the size of the search terms. initialize by loading data into buffer 1 repeat the following steps: clear buffer 2 dump buffer 1 into buffer 2 load fresh data into buffer 1 combine the buffers search the combination If the buffers are the correct size your search terms will always be in one combination. The overhead is that you will be searching each buffer twice though. hope it helps Shane

SimpleData

Yes, but I think that is not a problem for me. I am looking for a string in the file, no matter where it is. I think code can express everything, in a better way. Here is my code:

private long DigBinary(string file, string strToDig)
{
FileStream fs = null;

        char\[\] chAim = strToDig.ToCharArray();
        char chTemp = '0';
        long latestHitBeginningLocation = 0;
        int locationInArray = 0;

        try { fs = new FileStream(file, FileMode.Open, FileAccess.Read); }
        catch { throw new Exception("An error occured while creating the stream."); }

        try
        {
            while (locationInArray < chAim.Length)
            {
                chTemp = (char)fs.ReadByte();

                if( chTemp != chAim\[locationInArray\] )
                    locationInArray = 0;

                if (chTemp == chAim\[locationInArray\])
                {
                    if (locationInArray == 0)
                        latestHitBeginningLocation = fs.Position - 1;

                    if (locationInArray == chAim.Length)
                        break;

                    locationInArray++;
                }
                else
                {
                    locationInArray = 0;
                    latestHitBeginningLocation = 0;
                }
            }
        }
        catch { throw new Exception("An error occured while reading the file."); }
        finally { if (fs != null) { fs.Close(); fs.Dispose(); } }

        return latestHitBeginningLocation;
    }

And yes, I know that my try-catch is useless. :D

Luc Pattyn

that is a horrible piece of "code". X| X| X|

Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles]

I only read code that is properly formatted, adding PRE tags is the easiest way to obtain that.

SimpleData

I am open to suggestions.

Luc Pattyn

Here are some: - unspecified catch = deadly sin - store actual exception as inner exception in functional exception - user-generated exceptions should inherit from ApplicationException - two try blocks where one would suffice - redundant chTemp initialization - should use using statement - char[] chAim = strToDig.ToCharArray(); is redundant; use strToDig[index] And the algorithm is wrong, as I reported earlier. :)

Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles]

I only read code that is properly formatted, adding PRE tags is the easiest way to obtain that.

SimpleData

Thanks for the advices. I will change the code accordingly. This algorithm covers my needs. It works, it is fast and it doesn't consume much RAM. :)