Gathering Public Information
The amount of information you can gather about an organization’s business and information systems that is widely available on the Internet is staggering. To see for yourself, the techniques outlined in the following sections can be used to gather information about your own organization.
Social media
Social media sites are the new means for businesses interacting online. Perusing the following sites can provide untold details on any given business and its people:
Facebook    ( www.facebook.com )
 LinkedIn    ( www.linkedin.com )
 Twitter    ( https://twitter.com )
 YouTube    ( www.youtube.com )
As we’ve all witnessed, employees are often very forthcoming about what they do for work, details about their business, and even what they think about their bosses — especially after throwing back a few when their social filter has gone off track! I’ve also found interesting insight based on what ex-employees say about their former employers at Glassdoor ( www.glassdoor.com ).
Web search
Performing a web search or simply browsing your organization’s website can turn up the following information:
- Employee names and contact information
- Important company dates
- Incorporation filings
- SEC filings (for public companies)
- Press releases about physical moves, organizational changes, and new products
- Mergers and acquisitions
- Patents and trademarks
- Presentations, articles, webcasts, or webinars
With Google, you can search the Internet in several ways:
Typing keywords. This kind of search often reveals hundreds and sometimes millions of pages of information — such as files, phone numbers, and addresses — that you never guessed were available.
Performing advanced web searches. Google’s advanced search options can find sites that link back to your company’s website. This type of search often reveals a lot of information about partners, vendors, clients, and other affiliations.
Using    switches    to    dig    deeper    into    a    website.    For    example,    if    you    want    to    find    a certain    word    or    file    on    your    website,    simply    enter    a    line    like    one    of    the    following into    Google:
 site:www.your_domain.com    keyword
 site:www.your_domain.com    filename
Web crawling
Web-crawling utilities, such as HTTrack Website Copier ( www.httrack.com ), can mirror your website by downloading every publicly-accessible file from it, similar to how a web vulnerability scanner crawls the website it’s testing. You can then inspect that copy of the website offline, digging into the following:
- The website layout and configuration
- Directories and files that might not otherwise be obvious or readily accessible
- The HTML and script source code of web pages
- Comment fields
Comment fields often contain useful information such as names and e-mail addresses of the developers and internal IT personnel, server names, software versions, internal IP addressing schemes, and general comments about how the code works.
Websites
The following websites may provide specific information about an organization and its employees:
Government and business websites:
www.hoovers.com     and     http://finance.yahoo.com     give    detailed
 information    about    public    companies.
 www.sec.gov/edgar.shtml     shows    SEC    filings    of    public    companies.
 www.uspto.gov     offers    patent    and    trademark    registrations.
 The    website    for    your    state’s    Secretary    of    State    or    similar    organization    can
 offer    incorporation    and    corporate    officer    information.
Background    checks    and    other    personal    information,    from    websites    such    as:
 LexisNexis.com    ( www.lexisnexis.com )
 ZabaSearch    ( www.zabasearch.com )
