Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. Do you know

Do you know

Scheduled Pinned Locked Moved The Lounge
question
22 Posts 8 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • U Uros Calakovic

    Yes, it is Julia, here is the top 10:

    1 Julia 85
    2 meaning 36
    3 Syme 34
    4 society 30
    5 top 30
    6 cell 29
    7 vocabulary 29
    8 question 29
    9 turn 28
    10 speak 26

    It's quite interesting (or maybe my VBA is lousy).

    The bearing of a child takes nine months, no matter how many women are assigned.

    A Offline
    A Offline
    AspDotNetDev
    wrote on last edited by
    #13

    You lazy fox!

    [WikiLeaks Cablegate Cables]

    U 1 Reply Last reply
    0
    • W wizardzz

      I know you gave me a bunch of hints but I'm still surprised it was in my top guesses. Now I'm curious how your VBS could be crappy. How do you define a word? Spaces on both side? Or can it be followed by Punctuation, ie " Julia. " and " Julia, " still count right?

      "Life should not be a journey to the grave with the intention of arriving safely in a pretty and well preserved body, but rather to skid in broadside in a cloud of smoke, thoroughly used up, totally worn out, and loudly proclaiming "Wow! What a Ride!" — Hunter S. Thompson

      U Offline
      U Offline
      Uros Calakovic
      wrote on last edited by
      #14

      I just used Document.Words[^] trimming the spaces. It appears that in VBA 'Word' is a broad concept, I got several combinations of punctuation marks.

      The bearing of a child takes nine months, no matter how many women are assigned.

      1 Reply Last reply
      0
      • A AspDotNetDev

        You lazy fox!

        [WikiLeaks Cablegate Cables]

        U Offline
        U Offline
        Uros Calakovic
        wrote on last edited by
        #15

        Actually I was just trying to see how long it takes to enumerate words in a document and got carried away.

        The bearing of a child takes nine months, no matter how many women are assigned.

        1 Reply Last reply
        0
        • U Uros Calakovic

          what is the most commonly used word in Orwell's Nineteen Eighty-Four?

          The bearing of a child takes nine months, no matter how many women are assigned.

          OriginalGriffO Offline
          OriginalGriffO Offline
          OriginalGriff
          wrote on last edited by
          #16

          Um. According to a quick analysis, you are wrong:

          5796 the
          3463 of
          2445 a
          2309 was
          2297 to
          2293 and
          1686 in
          1443 that
          1374 it
          1338 had
          1315 he
          986 his
          820 not
          784 you
          745 with
          708 The
          658 as
          656 be
          651 He
          633 were
          598 is
          590 at
          583 on
          576 for
          543 It
          527 Winston

          "Julia" comes in at 100 times, the same as "away". And "the" is a proper word! ;P

          Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

          "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
          "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

          U 1 Reply Last reply
          0
          • OriginalGriffO OriginalGriff

            Um. According to a quick analysis, you are wrong:

            5796 the
            3463 of
            2445 a
            2309 was
            2297 to
            2293 and
            1686 in
            1443 that
            1374 it
            1338 had
            1315 he
            986 his
            820 not
            784 you
            745 with
            708 The
            658 as
            656 be
            651 He
            633 were
            598 is
            590 at
            583 on
            576 for
            543 It
            527 Winston

            "Julia" comes in at 100 times, the same as "away". And "the" is a proper word! ;P

            Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

            U Offline
            U Offline
            Uros Calakovic
            wrote on last edited by
            #17

            You're right. It turned out it was not the macro, I messed it up while importing to Excel and the text to columns feature swallowed several frequent words, including Winston. Maybe I should have used a different book for this. :doh:

            The bearing of a child takes nine months, no matter how many women are assigned.

            OriginalGriffO 1 Reply Last reply
            0
            • U Uros Calakovic

              You're right. It turned out it was not the macro, I messed it up while importing to Excel and the text to columns feature swallowed several frequent words, including Winston. Maybe I should have used a different book for this. :doh:

              The bearing of a child takes nine months, no matter how many women are assigned.

              OriginalGriffO Offline
              OriginalGriffO Offline
              OriginalGriff
              wrote on last edited by
              #18

              I did a PDF -> text, then knocked up a quick C# program:

                      string path = @"F:\\Temp\\1984.txt";
                      string\[\] lines = File.ReadAllLines(path);
                      Regex pageIndicator = new Regex(@"\\s=\\sPage\\s\\d+\\s=", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                      Regex findWords = new Regex(@"\\b\\w+\\b", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                      Dictionary allWords = new Dictionary();
                      int totalWords = 0;
                      foreach (string line in lines)
                          {
                          if (!pageIndicator.IsMatch(line))
                              {
                              if (!string.IsNullOrEmpty(line))
                                  {
                                  MatchCollection words = findWords.Matches(line);
                                  foreach (Match word in words)
                                      {
                                      totalWords++;
                                      if (allWords.ContainsKey(word.Value))
                                          {
                                          allWords\[word.Value\]++;
                                          }
                                      else
                                          {
                                          allWords.Add(word.Value, 1);
                                          }
                                      }
                                  }
                              }
                          }
                       labResults.Text = string.Format("Total: {0} words, with {1} distinct", totalWords, allWords.Count);
              

              Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

              "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
              "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

              U 1 Reply Last reply
              0
              • OriginalGriffO OriginalGriff

                I did a PDF -> text, then knocked up a quick C# program:

                        string path = @"F:\\Temp\\1984.txt";
                        string\[\] lines = File.ReadAllLines(path);
                        Regex pageIndicator = new Regex(@"\\s=\\sPage\\s\\d+\\s=", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                        Regex findWords = new Regex(@"\\b\\w+\\b", RegexOptions.Multiline | RegexOptions.CultureInvariant | RegexOptions.Compiled);
                        Dictionary allWords = new Dictionary();
                        int totalWords = 0;
                        foreach (string line in lines)
                            {
                            if (!pageIndicator.IsMatch(line))
                                {
                                if (!string.IsNullOrEmpty(line))
                                    {
                                    MatchCollection words = findWords.Matches(line);
                                    foreach (Match word in words)
                                        {
                                        totalWords++;
                                        if (allWords.ContainsKey(word.Value))
                                            {
                                            allWords\[word.Value\]++;
                                            }
                                        else
                                            {
                                            allWords.Add(word.Value, 1);
                                            }
                                        }
                                    }
                                }
                            }
                         labResults.Text = string.Format("Total: {0} words, with {1} distinct", totalWords, allWords.Count);
                

                Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                U Offline
                U Offline
                Uros Calakovic
                wrote on last edited by
                #19

                Must learn regex!

                The bearing of a child takes nine months, no matter how many women are assigned.

                OriginalGriffO 1 Reply Last reply
                0
                • U Uros Calakovic

                  Must learn regex!

                  The bearing of a child takes nine months, no matter how many women are assigned.

                  OriginalGriffO Offline
                  OriginalGriffO Offline
                  OriginalGriff
                  wrote on last edited by
                  #20

                  Worth it for something like this: I learnt it pretty much from playing with Expresso[^] - it explains, designs and tests regexes. It's free, and I wish I'd written it!

                  Real men don't use instructions. They are only the manufacturers opinion on how to put the thing together. Manfred R. Bihy: "Looks as if OP is learning resistant."

                  "I have no idea what I did, but I'm taking full credit for it." - ThisOldTony
                  "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt

                  1 Reply Last reply
                  0
                  • U Uros Calakovic

                    Probably, but I was only counting real words.

                    The bearing of a child takes nine months, no matter how many women are assigned.

                    P Offline
                    P Offline
                    Pete OHanlon
                    wrote on last edited by
                    #21

                    "The" is a real word. Your experiment is flawed.

                    I'm not a stalker, I just know things. Oh by the way, you're out of milk.

                    Forgive your enemies - it messes with their heads

                    My blog | My articles | MoXAML PowerToys | Onyx

                    U 1 Reply Last reply
                    0
                    • P Pete OHanlon

                      "The" is a real word. Your experiment is flawed.

                      I'm not a stalker, I just know things. Oh by the way, you're out of milk.

                      Forgive your enemies - it messes with their heads

                      My blog | My articles | MoXAML PowerToys | Onyx

                      U Offline
                      U Offline
                      Uros Calakovic
                      wrote on last edited by
                      #22

                      Pete O'Hanlon wrote:

                      "The" is a real word.

                      Well, yes it is, but can it really compete with real, manly words like 'vocabulary' or 'society'?

                      The bearing of a child takes nine months, no matter how many women are assigned.

                      1 Reply Last reply
                      0
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Don't have an account? Register

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups