Internet Holes: Network News Transfer Protocol

Copyright(c), 1996, Dr. Frederick B. Cohen

Series Introduction

The Internet is now the world's most popular network and it is full of potential vulnerabilities. In this series of articles, we explore the vulnerabilities of the Internet and what you can do to mitigate them.

An Introduction to Internet Newsgroups

Internet newsgroups provide a way for hundreds of users to post messages received by thousands of recipients every day. There are about 12,000 newsgroups on the Internet, so when you add it all up, that's a lot of information. To get a sense of the overall magnitude of news on the Internet, each day, there are over 100 megabytes of new information sent through the Internet news servers. Of course not all sites get all of the news feeds.

To make newsgroups work, each newsgroup is created based on a vote of the potential membership. Once a newsgroup exists, anyone on the Internet can post news to it. If the forum is unmoderated (as most are), anything that is posted is sent to all of the recipients whenever they next read the news. If the forum is moderated, the moderator provides a list of authorized users who can post to the newsgroup and allows postings to be placed on the news service by forwarding them with an authorization message to the service supplying the news feed.

News is exchanged through the Network News Transfer Protocol ( NNTP), which normally operates on TCP port 119 on computers connected to the Internet. The details of NNTP are covered in (RFC977) (titled Network News Transfer Protocol ) and (RFC1036) (titled Standard for Interchange of USENET Messages ). To quote from the RFC977:

"NNTP specifies a protocol for the distribution, inquiry, retrieval, and posting of news articles using a reliable stream-based transmission of news among the ARPA-Internet community. NNTP is designed so that news articles are stored in a central database allowing a subscriber to select only those items he wishes to read. Indexing, cross-referencing, and expiration of aged messages are also provided."

How Does NNTP Work?

When you read news, you send a request to a news server for a list of newsgroups that have been added since your last reading of the news by using the NEWGROUPS command, get the response, and can sign up for new services as desired. Next, you use the NEWNEWS command on each newsgroup you subscribe to to get a list of new articles for each newsgroup you are subscribed to. Finally, you request individual articles which are returned as requested.

All of these operations are carried out using a Transmission Control Protocol (TCP) channel through the Internet between your host and a news server, typically residing somewhere on the Internet, or perhaps within your organization. This action takes place on TCP port 119, and for those who are used to using telnet, you can connect to your local news server by using the command:

news-server

The protocol works in much the same way as the Simple Mail Transfer Protocol (SMTP) that exchanges mail is used. For example, the following session:

unix-prompt>telnet pubxfer.news.psi.net 119
Trying 38.8.77.2 ...
Connected to pubxfer.news.psi.net.
Escape character is '^]'.
200 server.net InterNetNews NNRP server INN 1.4 20-Mar-93 ready (posting ok).
help
100 Legal commands
  authinfo user Name|pass Password
  article [MessageID|Number]
  body [MessageID|Number]
  date
 ...
  xpat header range|MessageID pat [morepat...]
  xpath xpath MessageID
Report problems to 
.

Insecurities in NNTP

To understand the security implications of NNTP we have to look at the individual protocol elements and the implementation.

Individual Protocol Elements

NNTP commands result in responses. Certain responses are defined for each command, and each response is given a numerical value. Each command sequence is limited in length to 512 characters, and all transactions are in 7-bit ASCII codes. According to designers, most implementations are "8-bit clean".

Upon entry, the server typically displays its version, whether or not it permits posting, and the name of the system it runs on. The major risk here is that by identifying version information and the availability of posting, intelligence is provided to the potential attacker about whether to carry an attack any further.
The help command gives a list of many of the available commands, all of which must be implemented in every NNTP server. Normally, this is no threat since the protocol is openly published and no special information is printed in the response to a help command.
The ihave command indicates that the caller has a particular article (identified by a message number).If the server desires a copy of that article, it will return a response instructing the client to send the entire article. If the server does not want the article (if, for example, the server already has a copy of it), a response indicating that the article is not wanted will be returned.
This can be exploited in several ways. Without much work, you can claim to have articles, cause servers to load your articles rather than the actual articles (that may come later) and disrupt global news service. Since news servers exchange articles with each other, this may be able to cause a rippling effect throughout the Internet news servers. This technique may also be used to force news servers to prematurely throw out old information by consuming large amounts of disk space in numerous voluminous articles.
The last, list, newsgroups, newnews, and next commands have no obvious impacts other than consuming bandwidth in the servicing of the requests.
Post is used to post messages to a newsgroup. Postings can easily be forged by simply providing the proper heading in the messages to the NNTP server.
The Quit command terminates the TCP connection, and has no obvious negative impacts.
The Slave command is used to tell the server that the current client is acting as a slave server rather than a user session. According to the RFC, It may be used to indicate that priority should therefore be given to requests from this client, as it is presumably serving more than one person. It might also be used to determine which connections to close when system load levels are exceeded, perhaps giving preference to slave servers. This can be used to get higher priority than normal in accessing server information, and thus reduce the performance of other clients. By creating several such channels and getting a lot of information on each channel, server performance may be substantially reduced and other clients may be terminated. According to NNTP programmers, "nobody implements the slave command".

Insecurities in the Message Interchange Standard

News postings also follow RFC1036, Several recent attacks have started to exploit one of the RFC1036 capabilities to destroy news postings before delivery. We begin with an example of the format of an article:

Relay-Version: version B 2.10 2/13/83; site from.net
Posting-Version: version B 2.10 2/13/83; site intermediary.net
Path: intermediary!mhuxj!mhuxt!eagle!jerry
From: jerry@from.net (Jerry Jones)
Newsgroups: net.general, net.security, other.newsgroup
Subject: the header of a legitimate mail message
Message-ID: <642@from.net>
Date: Friday, 19-Nov-95 16:14:55 EST
Followup-To: net.strage.facts
Expires: Saturday, 1-Jan-99 00:00:00 EST
Date-Received: Friday, 19-Nov-95 16:59:30 EST
Organization: Special Operations Organization

     The body of the article comes here, after a blank line.

It is important to note that the contents of these headers can be specified by the person creating the message and can contain arbitrary, false, or misleading information. For example, it is easy to forge a message as if it had come from another user at another site, just as in the SMTP protocol. It is possible to introduce characters that might be misinterpreted by the delivery mechanisms of some NNTP forwarders. These are the same sort of attacks that have worked in various forms against sendmail for many years.

Any unrecognized headers are allowed, and will be passed through unchanged. The required headers are Relay-Version, Posting-Version, From, Date, Newsgroups, Subject, Message-ID, Path. The optional headers are Followup-To, Date-Received, Expires, Reply-To, Sender, References, Control, Distribution, Organization. For example, high priority may be gained by specifying a header interpreted by most news readers. A return-receipt-requested header (or some such thing) may be specified to cause some mailers to automatically return the addresses of those who have received the message, thus generating mailing lists of people interested in various topics without their knowledge or permission.

The message identity is used to track messages throughout the news process. In order to conform to (RFC822) , the Message-ID must have the format:

"<" "unique" "@" "full domain name" ">"

This can be trivially forged, and perhaps even more importantly, if the "unique" entry is not unique, some interesting consequences can result. For example, you might get two different versions of the same message distributed throughout the network - one with the word NOT strategically placed in the message stream. Some people will get the plain message, and others will get the NOT message. The resulting confusion may be worth watching if it's on a controversial subject in an active news group. RFC1036 also warns that:

"Programmers are urged not to make assumptions about the content of message ID fields from other hosts, but to treat them as unknown character strings. It is not safe, for example, to assume that a message ID will be under 14 characters, that it is unique in the first 14 characters, nor that is does not contain a "/"."

Long names have been used in many cases to overflow internal buffers and cause programs to execute malicious code as a result. As an aside, the angle brackets are considered part of the message ID.

The Path line can be used for interesting purposes depending on how the implementations operate. Again, according to RFC1036:

"There are several uses for this information. One is to monitor USENET routing for performance reasons. Another is to establish a path to reach new hosts. Perhaps the most important use is to cut down on redundant USENET traffic by failing to forward a message to a host that is known to have already received it."

By creating a path list containing sites I don't want to get a posting, I can prevent them from receiving the news! Since much of the news goes through a select set of sites, I can effectively limit a news posting to go only outside the United States (for example). Even more interestingly, I may be able to convince the news system to route news from select sites through my site by inserting my site in the path between them and the major servers. This could eventually cause me to control their news feeds, selecting which articles to send them, adding articles strategically, altering content, etc.

According to designers, even though the RFC specifies this routing method, nobody uses it. All routing is done manually. Putting on my auditor's hat, this means that the specification and implementation don't agree. Ouch! This attack may not work because the specification isn't followed.

Other interesting headers include Reply-To, Sender, Followup-To, Date-Received, Expires, References, Distribution, Organization, and of course Control which is used to send control messages between news servers. We'll skip over all but the Control messages to save space, but don't assume that just because we skipped over them they are all safe from exploitation.

From RFC1036: The body of the Control header is the control message. Messages are a sequence of zero or more words, separated by white space (blanks or tabs). The first word is the name of the control message, remaining words are parameters to the message. The remainder of the header and the body of the message are also potential parameters; for example, the "From" line might suggest an address to which a response is to be mailed. ... Implementors and administrators may choose to allow control messages to be automatically carried out, or to queue them for manual processing. However, manually processed messages should be dealt with promptly.

cancel message-ID is the control message used to cancel a previously sent message. Note the obvious danger! This mechanism allows a user to cancel an article after the article has been distributed over the network. To quote again:
In order to forge the cancellation of a news posting, all you have to do is copy the Sender line and create an address forgery. The concept of a verified sender isn't backed up by any protection mechanism. This has been used lately to systematically cancel news postings on forums throughout the Internet.
ihave/sendme is the control message protocol described earlier for updating servers with versions of articles.
newgroup is used to create new newsgroups. To quote the RFC: "Since no articles may be posted or forwarded until a newsgroup is created, this message is required before a newsgroup can be used. The body of the message is expected to be a short paragraph describing the intended use of the newsgroup." Although a voting scheme is purported to be in place before newsgroups are created, a proper forgery of this message may bypass this mechanism in some servers.
rmgroup Again quoting: "This message removes a newsgroup with the given name. Since the newsgroup is removed from every site on the network, this command should be used carefully by a responsible administrator." That should tell you what an irresponsible administrator could do.
Sendsys generates a listing of all neighbors and which newsgroups are sent to each neighbor and mails the listing to the author of the control message. To quote the RFC:
In other words, news service implies the ability to map the news space. Consider the implication on firewalls and how news services can work across firewalls, and you may find a potential conflict. The most obvious problem with such a service is that it may allow outsiders to generate network maps which can be exploited for nefarious purposes.
senduuname lists all uucp neighbors of the local site. Quoting from the RFC: This information is used to make maps of the UUCP network.The sys file is not the same as the UUCP L.sys file. The L.sys file should never be transmitted to another party without the consent of the sites whose passwords are listed therein.
If we can break into one NNTP server, we should be able to extend the attack without limitation to all neighbors recursively if this information is provided.

Implementation Issues

We don't know for certain about any implementation flaws in the programs that implement NNTP, (a letter to the editor revealed one) but if sendmail is any indicator, it is likely that there are. Here are some of the places to look for potential problems.

As in many programs, the daemons used to support NNTP may allow input data to go beyond the ends of the input buffers. This has caused numerous errors in almost every network daemon not specifically designed to be secure. If this can be done to NNTP daemons, the entire news network may be compromised. According to designers, the programs in widespread use have all been checked for buffer overruns to prevent this attack.
Most systems permit only 1024 TCP channels to any given port. It should be pretty easy to create 1024 TCP channels to port 119 on an NNTP server and thus prevent any further communications via NNTP. This could be used to slow or stop global news services.
Errors similar to those that cause many sendmail holes could be active in the NNTP servers commonly used on the Internet. This might allow an attacker to gain root access on NNTP servers. Again according the the designers: "All client connections are marked close-on-exec. It is impossible for client's session to get access to anything other than innd. Innd doesn't run as root; a wrapper opens the port, setuid's, and calls innd."
For NNTP servers that use the ident daemon to determine what user is accessing the NNTP server, the same ident bug problems that worked against sendmail might also work against NNTP servers.
According to designers, IP spoofing can work against NNTP servers however, current protection techniques catch about 9 out of every 10 attempts. There is also a simplistic clear-text password used by some sites that may prevent other spoofing attempts.
Finally, because news spreads through the Internet like a virus, an NNTP server which produces many different codes for the same message could cause global NNTP message expansion, disrupting services to thousands of news servers and consuming massive quantities of disk space worldwide.

Other Abuses of NNTP

There are some other ways that Internet news is abused related purely to what it is - a non-edited global news feed.

Spamming is the name given to sending advertising to newsgroups. These unwanted, off-topic, and very glitzy transmissions waste thousands of peoples' time, bandwidth, and patience every day. Nobody likes being spammed, but that's the nature of unedited forums as they exist today.
False or misleading news is often generated by people for reasons of ignorance, publicity or, in some cases, just to cause experts to believe something that isn't true. Clearly, an unedited forum cannot be believed on its face.
Widespread copyright violation occurs on Internet news. For example, many people republish news stories from commercial news services without paying copyright fees. It is also common for extracts from copyrighted programs to be published to newsgroups.

What Can We Do About It?

The fundamental challenge we face in network news is the integrity challenge. As a secondary issue, availability is a concern, however, since Internet news is not a critical system for many people, it is less of an issue. There is no privacy issue in Internet news since, by definition, it consists entirely of open forums with potentially unlimited distribution.

As a result of the widespread spamming and malicious use of the cancel capability, many people are now discussing ways to add authentication to network news, but there is no realistic solution on the horizon. For example, the new Internet protocols that are about to come out of the Internet Task Force don't provide strong authentication as a vital component of the protocols.

To support authentication in the Internet, we would need to implement a substantial infrastructure improvement in the form of a set of global key servers that could be used to associate cryptographic keys with individuals at sites. We would then have to migrate the entire Internet software suite toward the new standard, which would involve replacing or upgrading tens of millions of copies of about 100 different software components. The resulting overhead in terms of administration and bandwidth at this point in time would be excessive. Because of the nature of the Internet, it is unlikely that such services will become a predominant force in the next several years.

One thing we can all do is to protect ourselves and each other by moving increasingly toward moderated news forums and strong authentication between authorized news servers. The addition of integrity between major news feeds and editorial control over content is vital to having a good news service, whether it be Internet-based or otherwise. One good example of preventing forgeries by strong authentication is the use of PGP-based signatures on postings.

On the other side of the coin, moderators have human failings. We often find that the moderator is donating a lot of personal time and feels they have to get something back from their effort. Some want to advertise in their forums. Others want to limit expression of ideas. In the United States, a recent ruling against a major Internet service provider showed that any moderation of any sort leads to liability for all postings. Clearly, this is a legal blockade to moderation in the Internet.

Summary

Internet news is full of potentials for abuse. Some of these potentials are now being realized, while others are looming on the horizon. Because news is generally viewed as less critical than other systems, some may choose to abandon it, but for many, newsfeeds provide regular updates on fields of interest and act as a major source of information.

The solution to the NNTP problems, just as many of today's information protection challenges, is to address the integrity and availability issues head on.

A Final Comment

Some days, you just can't help getting good examples in the mail. This morning, as I finished writing this article, I got a piece of electronic mail from the Internet. It was from someone I don't know, to someone I don't know, and included a price quote of some sort. None of; my IP address, my site name, my user ID, or the user ID of anyone at any site I know of; was contained in the mail. I forwarded it to the supposed sender and recipient with a request to figure out how it got to me. If I can read other peoples' email by accident, imagine what I could do if I tried.

About The Author

Special Thanks

Special thanks go to Rich Salz who reviewed this manuscript for factual accuracy and helped make many improvements.