The Notebook Review forums were hosted by TechTarget, who shut down them down on January 31, 2022. This static read-only archive was pulled by NBR forum users between January 20 and January 31, 2022, in an effort to make sure that the valuable technical information that had been posted on the forums is preserved. For current discussions, many NBR forum users moved over to NotebookTalk.net after the shutdown.
Problems? See this thread at archive.org.

    Delete similar lines function in Word 2007?

    Discussion in 'Windows OS and Software' started by radopod, Jul 22, 2008.

  1. radopod

    radopod Notebook Evangelist

    Reputations:
    7
    Messages:
    417
    Likes Received:
    0
    Trophy Points:
    30
    I am using Office 2007 on my notebook. I am trying to organise a number of email addresses from various files for an upcoming event. The problem is that many of the email addresses are present a number of times in various files and it is tough to scan the whole document manually as it spans a total of 125 pages! I was wondering if there is a feature in Office 2007 which could eliminate these extra email addresses. I should admit that these addresses are present in separate lines one after the other. Hoping for a speedy reply.
     
  2. CGSUN

    CGSUN Notebook Guru

    Reputations:
    0
    Messages:
    56
    Likes Received:
    0
    Trophy Points:
    15
    How about find and replace in Word. Have you tried that?
    You could export it to a text doc and scan it with an html editor to speed up the process.

    not a spam mailer is it :p
     
  3. gerryf19

    gerryf19 I am the walrus

    Reputations:
    2,275
    Messages:
    3,990
    Likes Received:
    0
    Trophy Points:
    105
    It was not too difficult in Office 2003, but it is SUPER easy in 2007 if the formatting can be made tabular or into something similar...would need to see what you have to explain it easily, but something like

    john smith [email protected]
    jane smith [email protected]
    ....


    Open the files in Word, then save them as a text file.

    Open Excel, go to data tab, import from text, once you have all the emails in one column, select the column and click (on DATA tab) REMOVE DUPLICATES.

    boom, you're done.

    The hard part (and it's not that hard) is getting it into a format that excel can parse eaily--or, after parsing, getting the necessary data into proper columns.

    I did exactly what you need to do with a 6 physician hospital directories (since physicians have hospital privilidges in multiple hospitals) -- took me about two hours (these files were all VERY different so this was what we call "ugly" data, I doubt it will take you that long) to set up the files and 5 seconds to get rid of all the duplicates