Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. Do you know

Do you know

Scheduled Pinned Locked Moved The Lounge
question
22 Posts 8 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • RaviBeeR RaviBee

    "The" ? /ravi

    My new year resolution: 2048 x 1536 Home | Articles | My .NET bits | Freeware ravib(at)ravib(dot)com

    M Offline
    M Offline
    musefan
    wrote on last edited by
    #5

    Surely "the" would be more common?

    I may or may not be responsible for my own actions

    RaviBeeR 1 Reply Last reply
    0
    • RaviBeeR RaviBee

      "The" ? /ravi

      My new year resolution: 2048 x 1536 Home | Articles | My .NET bits | Freeware ravib(at)ravib(dot)com

      U Offline
      U Offline
      Uros Calakovic
      wrote on last edited by
      #6

      Probably, but I was only counting real words.

      The bearing of a child takes nine months, no matter how many women are assigned.

      P 1 Reply Last reply
      0
      • M musefan

        Surely "the" would be more common?

        I may or may not be responsible for my own actions

        RaviBeeR Offline
        RaviBeeR Offline
        RaviBee
        wrote on last edited by
        #7

        Undoubtedly! :) /ravi

        My new year resolution: 2048 x 1536 Home | Articles | My .NET bits | Freeware ravib(at)ravib(dot)com

        1 Reply Last reply
        0
        • U Uros Calakovic

          what is the most commonly used word in Orwell's Nineteen Eighty-Four?

          The bearing of a child takes nine months, no matter how many women are assigned.

          W Offline
          W Offline
          wizardzz
          wrote on last edited by
          #8

          Hmm real words only? No idea, do pronouns count? Is it a verb, adjective, or noun, my guess is noun.

          "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

          U 1 Reply Last reply
          0
          • W wizardzz

            Hmm real words only? No idea, do pronouns count? Is it a verb, adjective, or noun, my guess is noun.

            "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

            U Offline
            U Offline
            Uros Calakovic
            wrote on last edited by
            #9

            It is a proper noun :)

            The bearing of a child takes nine months, no matter how many women are assigned.

            W 1 Reply Last reply
            0
            • U Uros Calakovic

              It is a proper noun :)

              The bearing of a child takes nine months, no matter how many women are assigned.

              W Offline
              W Offline
              wizardzz
              wrote on last edited by
              #10

              Must be Big Brother then? or Julia or London. What a great book, reminds me so much of China it's not even funny.

              "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

              U 1 Reply Last reply
              0
              • W wizardzz

                Must be Big Brother then? or Julia or London. What a great book, reminds me so much of China it's not even funny.

                "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

                U Offline
                U Offline
                Uros Calakovic
                wrote on last edited by
                #11

                Yes, it is Julia, here is the top 10:

                1 Julia 85
                2 meaning 36
                3 Syme 34
                4 society 30
                5 top 30
                6 cell 29
                7 vocabulary 29
                8 question 29
                9 turn 28
                10 speak 26

                It's quite interesting (or maybe my VBA is lousy).

                The bearing of a child takes nine months, no matter how many women are assigned.

                W A 2 Replies Last reply
                0
                • U Uros Calakovic

                  Yes, it is Julia, here is the top 10:

                  1 Julia 85
                  2 meaning 36
                  3 Syme 34
                  4 society 30
                  5 top 30
                  6 cell 29
                  7 vocabulary 29
                  8 question 29
                  9 turn 28
                  10 speak 26

                  It's quite interesting (or maybe my VBA is lousy).

                  The bearing of a child takes nine months, no matter how many women are assigned.

                  W Offline
                  W Offline
                  wizardzz
                  wrote on last edited by
                  #12

                  I know you gave me a bunch of hints but I'm still surprised it was in my top guesses. Now I'm curious how your VBS could be crappy. How do you define a word? Spaces on both side? Or can it be followed by Punctuation, ie " Julia. " and " Julia, " still count right?

                  "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

                  U 1 Reply Last reply
                  0
                  • U Uros Calakovic

                    Yes, it is Julia, here is the top 10:

                    1 Julia 85
                    2 meaning 36
                    3 Syme 34
                    4 society 30
                    5 top 30
                    6 cell 29
                    7 vocabulary 29
                    8 question 29
                    9 turn 28
                    10 speak 26

                    It's quite interesting (or maybe my VBA is lousy).

                    The bearing of a child takes nine months, no matter how many women are assigned.

                    A Offline
                    A Offline
                    AspDotNetDev
                    wrote on last edited by
                    #13

                    You lazy fox!

                    [WikiLeaks Cablegate Cables]

                    U 1 Reply Last reply
                    0
                    • W wizardzz

                      I know you gave me a bunch of hints but I'm still surprised it was in my top guesses. Now I'm curious how your VBS could be crappy. How do you define a word? Spaces on both side? Or can it be followed by Punctuation, ie " Julia. " and " Julia, " still count right?

                      "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

                      U Offline
                      U Offline
                      Uros Calakovic
                      wrote on last edited by
                      #14

                      I just used Document.Words[^] trimming the spaces. It appears that in VBA 'Word' is a broad concept, I got several combinations of punctuation marks.

                      The bearing of a child takes nine months, no matter how many women are assigned.

                      1 Reply Last reply
                      0
                      • A AspDotNetDev

                        You lazy fox!

                        [WikiLeaks Cablegate Cables]

                        U Offline
                        U Offline
                        Uros Calakovic
                        wrote on last edited by
                        #15

                        Actually I was just trying to see how long it takes to enumerate words in a document and got carried away.

                        The bearing of a child takes nine months, no matter how many women are assigned.

                        1 Reply Last reply
                        0
                        • U Uros Calakovic

                          what is the most commonly used word in Orwell's Nineteen Eighty-Four?

                          The bearing of a child takes nine months, no matter how many women are assigned.

                          OriginalGriffO Offline
                          OriginalGriffO Offline
                          OriginalGriff
                          wrote on last edited by
                          #16

                          Um. According to a quick analysis, you are wrong:

                          5796 the
                          3463 of
                          2445 a
                          2309 was
                          2297 to
                          2293 and
                          1686 in
                          1443 that
                          1374 it
                          1338 had
                          1315 he
                          986 his
                          820 not
                          784 you
                          745 with
                          708 The
                          658 as
                          656 be
                          651 He
                          633 were
                          598 is
                          590 at
                          583 on
                          576 for
                          543 It
                          527 Winston

                          "Julia" comes in at 100 times, the same as "away". And "the" is a proper word! ;P

                          Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                          "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
                          "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

                          U 1 Reply Last reply
                          0
                          • OriginalGriffO OriginalGriff

                            Um. According to a quick analysis, you are wrong:

                            5796 the
                            3463 of
                            2445 a
                            2309 was
                            2297 to
                            2293 and
                            1686 in
                            1443 that
                            1374 it
                            1338 had
                            1315 he
                            986 his
                            820 not
                            784 you
                            745 with
                            708 The
                            658 as
                            656 be
                            651 He
                            633 were
                            598 is
                            590 at
                            583 on
                            576 for
                            543 It
                            527 Winston

                            "Julia" comes in at 100 times, the same as "away". And "the" is a proper word! ;P

                            Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                            U Offline
                            U Offline
                            Uros Calakovic
                            wrote on last edited by
                            #17

                            You're right. It turned out it was not the macro, I messed it up while importing to Excel and the text to columns feature swallowed several frequent words, including Winston. Maybe I should have used a different book for this. :doh:

                            The bearing of a child takes nine months, no matter how many women are assigned.

                            OriginalGriffO 1 Reply Last reply
                            0
                            • U Uros Calakovic

                              You're right. It turned out it was not the macro, I messed it up while importing to Excel and the text to columns feature swallowed several frequent words, including Winston. Maybe I should have used a different book for this. :doh:

                              The bearing of a child takes nine months, no matter how many women are assigned.

                              OriginalGriffO Offline
                              OriginalGriffO Offline
                              OriginalGriff
                              wrote on last edited by
                              #18

                              I did a PDF -> text, then knocked up a quick C# program:

                                      string path = @"F:\\Temp\\1984.txt";
                                      string\[\] lines = File.ReadAllLines(path);
                                      Regex pageIndicator = new Regex(@"\\s=\\sPage\\s\\d+\\s=", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                                      Regex findWords = new Regex(@"\\b\\w+\\b", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                                      Dictionary allWords = new Dictionary();
                                      int totalWords = 0;
                                      foreach (string line in lines)
                                          {
                                          if (!pageIndicator.IsMatch(line))
                                              {
                                              if (!string.IsNullOrEmpty(line))
                                                  {
                                                  MatchCollection words = findWords.Matches(line);
                                                  foreach (Match word in words)
                                                      {
                                                      totalWords++;
                                                      if (allWords.ContainsKey(word.Value))
                                                          {
                                                          allWords\[word.Value\]++;
                                                          }
                                                      else
                                                          {
                                                          allWords.Add(word.Value, 1);
                                                          }
                                                      }
                                                  }
                                              }
                                          }
                                       labResults.Text = string.Format("Total: {0} words, with {1} distinct", totalWords, allWords.Count);
                              

                              Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                              "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
                              "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

                              U 1 Reply Last reply
                              0
                              • OriginalGriffO OriginalGriff

                                I did a PDF -> text, then knocked up a quick C# program:

                                        string path = @"F:\\Temp\\1984.txt";
                                        string\[\] lines = File.ReadAllLines(path);
                                        Regex pageIndicator = new Regex(@"\\s=\\sPage\\s\\d+\\s=", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                                        Regex findWords = new Regex(@"\\b\\w+\\b", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                                        Dictionary allWords = new Dictionary();
                                        int totalWords = 0;
                                        foreach (string line in lines)
                                            {
                                            if (!pageIndicator.IsMatch(line))
                                                {
                                                if (!string.IsNullOrEmpty(line))
                                                    {
                                                    MatchCollection words = findWords.Matches(line);
                                                    foreach (Match word in words)
                                                        {
                                                        totalWords++;
                                                        if (allWords.ContainsKey(word.Value))
                                                            {
                                                            allWords\[word.Value\]++;
                                                            }
                                                        else
                                                            {
                                                            allWords.Add(word.Value, 1);
                                                            }
                                                        }
                                                    }
                                                }
                                            }
                                         labResults.Text = string.Format("Total: {0} words, with {1} distinct", totalWords, allWords.Count);
                                

                                Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                                U Offline
                                U Offline
                                Uros Calakovic
                                wrote on last edited by
                                #19

                                Must learn regex!

                                The bearing of a child takes nine months, no matter how many women are assigned.

                                OriginalGriffO 1 Reply Last reply
                                0
                                • U Uros Calakovic

                                  Must learn regex!

                                  The bearing of a child takes nine months, no matter how many women are assigned.

                                  OriginalGriffO Offline
                                  OriginalGriffO Offline
                                  OriginalGriff
                                  wrote on last edited by
                                  #20

                                  Worth it for something like this: I learnt it pretty much from playing with Expresso[^] - it explains, designs and tests regexes. It's free, and I wish I'd written it!

                                  Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                                  "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
                                  "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

                                  1 Reply Last reply
                                  0
                                  • U Uros Calakovic

                                    Probably, but I was only counting real words.

                                    The bearing of a child takes nine months, no matter how many women are assigned.

                                    P Offline
                                    P Offline
                                    Pete OHanlon
                                    wrote on last edited by
                                    #21

                                    "The" is a real word. Your experiment is flawed.

                                    I'm not a stalker, I just know things. Oh by the way, you're out of milk.

                                    Forgive your enemies - it messes with their heads

                                    My blog | My articles | MoXAML PowerToys | Onyx

                                    U 1 Reply Last reply
                                    0
                                    • P Pete OHanlon

                                      "The" is a real word. Your experiment is flawed.

                                      I'm not a stalker, I just know things. Oh by the way, you're out of milk.

                                      Forgive your enemies - it messes with their heads

                                      My blog | My articles | MoXAML PowerToys | Onyx

                                      U Offline
                                      U Offline
                                      Uros Calakovic
                                      wrote on last edited by
                                      #22

                                      Pete O'Hanlon wrote:

                                      "The" is a real word.

                                      Well, yes it is, but can it really compete with real, manly words like 'vocabulary' or 'society'?

                                      The bearing of a child takes nine months, no matter how many women are assigned.

                                      1 Reply Last reply
                                      0
                                      Reply
                                      • Reply as topic
                                      Log in to reply
                                      • Oldest to Newest
                                      • Newest to Oldest
                                      • Most Votes


                                      • Login

                                      • Don't have an account? Register

                                      • Login or register to search.
                                      • First post
                                        Last post
                                      0
                                      • Categories
                                      • Recent
                                      • Tags
                                      • Popular
                                      • World
                                      • Users
                                      • Groups