Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. Do you know

Do you know

Scheduled Pinned Locked Moved The Lounge
question
22 Posts 8 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • U Uros Calakovic

    Yes, it is Julia, here is the top 10:

    1 Julia 85
    2 meaning 36
    3 Syme 34
    4 society 30
    5 top 30
    6 cell 29
    7 vocabulary 29
    8 question 29
    9 turn 28
    10 speak 26

    It's quite interesting (or maybe my VBA is lousy).

    The bearing of a child takes nine months, no matter how many women are assigned.

    W Offline
    W Offline
    wizardzz
    wrote on last edited by
    #12

    I know you gave me a bunch of hints but I'm still surprised it was in my top guesses. Now I'm curious how your VBS could be crappy. How do you define a word? Spaces on both side? Or can it be followed by Punctuation, ie " Julia. " and " Julia, " still count right?

    "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

    U 1 Reply Last reply
    0
    • U Uros Calakovic

      Yes, it is Julia, here is the top 10:

      1 Julia 85
      2 meaning 36
      3 Syme 34
      4 society 30
      5 top 30
      6 cell 29
      7 vocabulary 29
      8 question 29
      9 turn 28
      10 speak 26

      It's quite interesting (or maybe my VBA is lousy).

      The bearing of a child takes nine months, no matter how many women are assigned.

      A Offline
      A Offline
      AspDotNetDev
      wrote on last edited by
      #13

      You lazy fox!

      [WikiLeaks Cablegate Cables]

      U 1 Reply Last reply
      0
      • W wizardzz

        I know you gave me a bunch of hints but I'm still surprised it was in my top guesses. Now I'm curious how your VBS could be crappy. How do you define a word? Spaces on both side? Or can it be followed by Punctuation, ie " Julia. " and " Julia, " still count right?

        "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

        U Offline
        U Offline
        Uros Calakovic
        wrote on last edited by
        #14

        I just used Document.Words[^] trimming the spaces. It appears that in VBA 'Word' is a broad concept, I got several combinations of punctuation marks.

        The bearing of a child takes nine months, no matter how many women are assigned.

        1 Reply Last reply
        0
        • A AspDotNetDev

          You lazy fox!

          [WikiLeaks Cablegate Cables]

          U Offline
          U Offline
          Uros Calakovic
          wrote on last edited by
          #15

          Actually I was just trying to see how long it takes to enumerate words in a document and got carried away.

          The bearing of a child takes nine months, no matter how many women are assigned.

          1 Reply Last reply
          0
          • U Uros Calakovic

            what is the most commonly used word in Orwell's Nineteen Eighty-Four?

            The bearing of a child takes nine months, no matter how many women are assigned.

            OriginalGriffO Offline
            OriginalGriffO Offline
            OriginalGriff
            wrote on last edited by
            #16

            Um. According to a quick analysis, you are wrong:

            5796 the
            3463 of
            2445 a
            2309 was
            2297 to
            2293 and
            1686 in
            1443 that
            1374 it
            1338 had
            1315 he
            986 his
            820 not
            784 you
            745 with
            708 The
            658 as
            656 be
            651 He
            633 were
            598 is
            590 at
            583 on
            576 for
            543 It
            527 Winston

            "Julia" comes in at 100 times, the same as "away". And "the" is a proper word! ;P

            Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

            "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
            "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

            U 1 Reply Last reply
            0
            • OriginalGriffO OriginalGriff

              Um. According to a quick analysis, you are wrong:

              5796 the
              3463 of
              2445 a
              2309 was
              2297 to
              2293 and
              1686 in
              1443 that
              1374 it
              1338 had
              1315 he
              986 his
              820 not
              784 you
              745 with
              708 The
              658 as
              656 be
              651 He
              633 were
              598 is
              590 at
              583 on
              576 for
              543 It
              527 Winston

              "Julia" comes in at 100 times, the same as "away". And "the" is a proper word! ;P

              Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

              U Offline
              U Offline
              Uros Calakovic
              wrote on last edited by
              #17

              You're right. It turned out it was not the macro, I messed it up while importing to Excel and the text to columns feature swallowed several frequent words, including Winston. Maybe I should have used a different book for this. :doh:

              The bearing of a child takes nine months, no matter how many women are assigned.

              OriginalGriffO 1 Reply Last reply
              0
              • U Uros Calakovic

                You're right. It turned out it was not the macro, I messed it up while importing to Excel and the text to columns feature swallowed several frequent words, including Winston. Maybe I should have used a different book for this. :doh:

                The bearing of a child takes nine months, no matter how many women are assigned.

                OriginalGriffO Offline
                OriginalGriffO Offline
                OriginalGriff
                wrote on last edited by
                #18

                I did a PDF -> text, then knocked up a quick C# program:

                        string path = @"F:\\Temp\\1984.txt";
                        string\[\] lines = File.ReadAllLines(path);
                        Regex pageIndicator = new Regex(@"\\s=\\sPage\\s\\d+\\s=", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                        Regex findWords = new Regex(@"\\b\\w+\\b", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                        Dictionary allWords = new Dictionary();
                        int totalWords = 0;
                        foreach (string line in lines)
                            {
                            if (!pageIndicator.IsMatch(line))
                                {
                                if (!string.IsNullOrEmpty(line))
                                    {
                                    MatchCollection words = findWords.Matches(line);
                                    foreach (Match word in words)
                                        {
                                        totalWords++;
                                        if (allWords.ContainsKey(word.Value))
                                            {
                                            allWords\[word.Value\]++;
                                            }
                                        else
                                            {
                                            allWords.Add(word.Value, 1);
                                            }
                                        }
                                    }
                                }
                            }
                         labResults.Text = string.Format("Total: {0} words, with {1} distinct", totalWords, allWords.Count);
                

                Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
                "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

                U 1 Reply Last reply
                0
                • OriginalGriffO OriginalGriff

                  I did a PDF -> text, then knocked up a quick C# program:

                          string path = @"F:\\Temp\\1984.txt";
                          string\[\] lines = File.ReadAllLines(path);
                          Regex pageIndicator = new Regex(@"\\s=\\sPage\\s\\d+\\s=", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                          Regex findWords = new Regex(@"\\b\\w+\\b", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                          Dictionary allWords = new Dictionary();
                          int totalWords = 0;
                          foreach (string line in lines)
                              {
                              if (!pageIndicator.IsMatch(line))
                                  {
                                  if (!string.IsNullOrEmpty(line))
                                      {
                                      MatchCollection words = findWords.Matches(line);
                                      foreach (Match word in words)
                                          {
                                          totalWords++;
                                          if (allWords.ContainsKey(word.Value))
                                              {
                                              allWords\[word.Value\]++;
                                              }
                                          else
                                              {
                                              allWords.Add(word.Value, 1);
                                              }
                                          }
                                      }
                                  }
                              }
                           labResults.Text = string.Format("Total: {0} words, with {1} distinct", totalWords, allWords.Count);
                  

                  Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                  U Offline
                  U Offline
                  Uros Calakovic
                  wrote on last edited by
                  #19

                  Must learn regex!

                  The bearing of a child takes nine months, no matter how many women are assigned.

                  OriginalGriffO 1 Reply Last reply
                  0
                  • U Uros Calakovic

                    Must learn regex!

                    The bearing of a child takes nine months, no matter how many women are assigned.

                    OriginalGriffO Offline
                    OriginalGriffO Offline
                    OriginalGriff
                    wrote on last edited by
                    #20

                    Worth it for something like this: I learnt it pretty much from playing with Expresso[^] - it explains, designs and tests regexes. It's free, and I wish I'd written it!

                    Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                    "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
                    "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

                    1 Reply Last reply
                    0
                    • U Uros Calakovic

                      Probably, but I was only counting real words.

                      The bearing of a child takes nine months, no matter how many women are assigned.

                      P Offline
                      P Offline
                      Pete OHanlon
                      wrote on last edited by
                      #21

                      "The" is a real word. Your experiment is flawed.

                      I'm not a stalker, I just know things. Oh by the way, you're out of milk.

                      Forgive your enemies - it messes with their heads

                      My blog | My articles | MoXAML PowerToys | Onyx

                      U 1 Reply Last reply
                      0
                      • P Pete OHanlon

                        "The" is a real word. Your experiment is flawed.

                        I'm not a stalker, I just know things. Oh by the way, you're out of milk.

                        Forgive your enemies - it messes with their heads

                        My blog | My articles | MoXAML PowerToys | Onyx

                        U Offline
                        U Offline
                        Uros Calakovic
                        wrote on last edited by
                        #22

                        Pete O'Hanlon wrote:

                        "The" is a real word.

                        Well, yes it is, but can it really compete with real, manly words like 'vocabulary' or 'society'?

                        The bearing of a child takes nine months, no matter how many women are assigned.

                        1 Reply Last reply
                        0
                        Reply
                        • Reply as topic
                        Log in to reply
                        • Oldest to Newest
                        • Newest to Oldest
                        • Most Votes


                        • Login

                        • Don't have an account? Register

                        • Login or register to search.
                        • First post
                          Last post
                        0
                        • Categories
                        • Recent
                        • Tags
                        • Popular
                        • World
                        • Users
                        • Groups