Methodology

Capitol Words are determined by capturing the full text of the House, Senate and Extension of Remarks sections of the Congressional Record for every day, dating back to the second session of the 106th Congress (January 20, 2000), via GPO Access and storing it on Sunlight's LOUIS database. Sunlight then runs a query on LOUIS to calculate the most commonly used words for a given day, with some exceptions, (described in more detail below). Each afternoon, the daily counts for the previous day are added to the Capitol Words database. Then Sunlight runs queries in the Capitol Words database to determine the most commonly used words by lawmaker and state.

The word count calculated by Sunlight does not include the Daily Digest section in the Congressional Record. This section summarizes the daily activities of Congress. The word count also excludes several sets of commonly used words that do not have substantive meaning. This includes words of two letters or less, a list of common congressional procedural words that was determined by Sunlight and a list of commonly used 'stop-words'. (Stop-words are terms that are commonly ignored by search engines and other data indexers to ensure that only valuable content is queried.) 'Capitol Words' stop-word list is based on the stop-word list provided by the text indexing engine, Onix, in its full indexing toolkit. Capitol Words' list of stop-words is dynamic and may be modified, as needed.

(Please note we are in the process updating the data for a small number of lawmakers. They will be clearly identified on their profile page.)

Lawmakers

Heat Map of Vocal States

(last 60 days)
Click on a state below for more information.

Words of the Day

March 08, 2010
Click on a word below for more information.
  1. 100% 80 office
  2. 100% 79 days
  3. 100% 69 health
  4. 100% 68 code
  5. 100% 67 proposed
  6. 100% 66 service
  7. 100% 65 waiting
  8. 100% 65 report
  9. 100% 64 nominated
  10. 100% 63 nomination
  11. 100% 62 fisheries
  12. 100% 59 district
  13. 100% 58 date
  14. 100% 57 nuclear
  15. 100% 57 agency
  16. 100% 56 revenue
  17. 100% 54 public
  18. 100% 52 director
  19. 100% 51 commerce
  20. 100% 50 military