10.4. Multiple Connections to the Internet
The questions summarized in this section should rightly be entered into the FAQ, since they are FAQs on the LARTC list.
There are many places where a linux based router/masquerading device can assist in managing multiple Internet connections. We'll outline here some of the more common setups involving multiple Internet connections and how to manage them with iptables, ipchains, and iproute2. One of the first distinctions you can make when planning how to use multiple Internet connections is what inbound services you expect to host and how you want to split traffic over the multiple links.
In the discussion and examples below, I'll address the issues involved with two separate uplinks to two different providers. I assume the following:
- You are not using BGP, and you do not have your own AS. If you are using BGP and have your own AS, you have a different set of problems than the problems described here .
- You have two netblocks from two different ISPs.
- You are funneling your internal network through this routing device, which is performing masquerading/NAT to the Internet.
Additionally, I'll restrict my comments to statically assigned public IP address ranges unless I mention (in particular) dynamically allocated addresses.
In the following sections we'll look at the use of multiple Internet connections first in terms of outbound traffic only, then in terms of inbound traffic only. After that, we'll look at using multiple Internet connections for handling both inbound and outbound services.
10.4.1. Outbound traffic Using Multiple Connections to the Internet
There are two main uses for multiple Internet links connected to the same internal network. One common use is to select an outbound link based on the type of outbound service. The other is to split traffic arbitrarily across multiple ISPs for reasons like failover and to accommodate greater aggregate bandwidth than would be available on a single uplink.
If your need is the latter, please consult the documentation on the LARTC site, as it does a good job of summarizing the issues involved and describes how to accomplish this. This type of use of multiple Internet connections means that (from the perspective of the linux routing device), there is a multipath default route. The LARTC documentation remarks that Julian Anastasov's patches "make things nicer to work with." The patches to which the LARTC documents are referring are Julian's dead gateway detection patches (at least) which can help the linux routing device provide Internet service to the internal network when one of the links is down. See here for Julian's route work.
In the remainder of this section, we'll discuss how to classify traffic for different ISPs, how to handle the packet filtering for this sort of classification scheme, and how to create routing tables appropriate for the task at hand. If anything at all seems unclear in this section, you may find a quick re-reading of the advanced routing overview quite fruitful.
The simplest way to split Internet access into two separate groups is by source IP of the outbound packet. This can be done most simply with ip rule and a second routing table. We'll assume that
masq-gw in the example network gets a second, low cost network connection through a DSL vendor.
The DSL IP on
masq-gw will be 126.96.36.199 with a gateway of 188.8.131.52. We'll assume that this is for outbound connectivity only, and that the IP is active on eth4 of the
masq-gw machine. Before beginning let's outline the process we are going to follow.
- Copy the main routing table to another routing table and set the alternate default route .
- Use iptables/ipchains to mark traffic with fwmark.
- Add a rule to the routing policy database.
Here's a short snippet of shell which you may find handy for copying one routing table to another; see the full script for a more generalized example.
Example 10.1. Multiple Outbound Internet links, part I; ip route
Now, exactly what have we just done? We have created two routing tables on
masq-gw each of which has a different default gateway. We have successfully accomplished the first part of our preparations.
Now, let's mark the traffic we would like to route in using conditional logic. We'll use iptables to select traffic bound for destination ports 80 and 443 originating in the main office desktop network.
Example 10.2. Multiple Outbound Internet links, part II; iptables
With these iptables lines we have instructed netfilter to mark packets matching these criteria with the fwmark and we have prepared the NAT rules so that our outbound packets will originate from the correct IPs.
Once again, it is important to realize that the fwmark added to a packet is only valid and discernible while the packet is still on the host running the packet filter. The fwmark is stored in a data structure the kernel uses to track the packet. Because the fwmark is not a part of the packet itself, the fwmark is lost as soon as the packet has left the local machine. For more detail on the use of fwmark, see Section 10.3.2, "Using fwmark for Policy Routing".
iproute2 supports the use of fwmark as a selector for rule lookups, so we can use fwmarks in the routing policy database to cause packets to be conditionally routed based on that fwmark. This can lead to great complexity if a machine has multiple routing tables, packet filters, and other fancy networking tools, such as NAT or proxies. Caveat emptor.
A convention I find sensible is to use the same number for a routing table and fwmark where possible. This simplifies the maintenance of the systems which are using iproute2 and fwmark, especially if the table identifier and fwmark are set in a configuration file with the same variable name. Since we are testing this on the command line, we'll just make sure that we can add the rules first.
Example 10.3. Multiple Outbound Internet links, part III; ip rule
The last piece is in place. Now, users in the 192.168.99.0/24 subnet who are browsing the Internet should be using the DSL line instead of the T1 line for connectivity.
In order to verify that traffic is indeed getting marked and routed appropriately, you should use tcpdump to profile the outbound traffic on each link at the same time as you generate outbound traffic on both links.
The above is a cookbook example of categorizing traffic, and sending the traffic out across different providers. To my knowledge, the commonest reason to use this sort of solution is to separate traffic by importance and use a reliable (and perhaps more costly) link for the more important traffic while reserving the less costly Internet connection for other connections. In the above illustrative case, we have simply selected the web traffic for the less reliable (DSL) provider.
Once again, if you would like to split load over multiple links regardless of classification of traffic, then you really want a multipath default route, which is described and documented very well in the LARTC HOWTO.
10.4.2. Inbound traffic Using Multiple Connections to the Internet
There are many different ways to handle hosting servers to multiple ISPs, and most of them are out of the scope of this document. If you are in need of this sort of advanced networking, you probably already know where to research. If not, I'd suggest starting your research in load balancing, global load balancing, failover, and layer 4-7 switching. These are networking tools which can facilitate the management of a highly available service.
Publishing the same service on two different ISPs is can be formidable challenge. While this is possible using some of the advanced networking features under linux, one should understand the greater issues involved with publishing a service on two public IPs, especially if the idea is to provide service to the general Internet even if one of the ISPs go down. For a thorough examination of the topics involved with load balancing of all kinds, see Chandra Kopparapu's book Load Balancing Servers, Firewalls and Caches.
If you are aware of the many difficult issues involved in handling inbound connections to a network, and still want to publish a service on two different ISPs (perhaps before you have a more robust load balancing/upper layer switching technology in place), you'll find the recipe below.
Before we examine the recipe, let's look at a complex scenario to see what the crucial points are. If you do not have the kernel packet traveling diagram memorized, you may wish to refer to it in the following discussion. One other item to remember is that routing decisions are stateless .
We'll assume that the client IP is a fixed IP (184.108.40.206) and we'll discuss how this client IP would reach each of the services published on
masq-gw's two public networks. The IPs used for the services will be 220.127.116.11 and 18.104.22.168. Now, whether you are using NAT with iproute2 or with iptables, you'll run across the problem here outlined. Here is the flow of the packet through
masq-gw to the server and back to the client.
Inbound NAT to the same server via two public IPs in two different networks
- inbound packet from 22.214.171.124 to 126.96.36.199 arrives on eth4
- packet is accepted, rewritten, and routed; from 188.8.131.52 to 192.168.100.17; if iptables DNAT, packet is rewritten in PREROUTING chain of nat table, then routed; if iproute2, packet is routed and rewritten simultaneously
- rewritten packet is transmitted out eth0
isoldereceives packet, accepts, responds
- inbound packet from 192.168.100.17 to 184.108.40.206
- routing decision is made; default route (via 220.127.116.11) is selected; if iproute2 is used, packet is also rewritten from 18.104.22.168 to 22.214.171.124
- if iptables DNAT is used, connection tracking will take care of rewriting this packet from 126.96.36.199 to 188.8.131.52
- packet is transmitted out eth1
This is the problem! The packet may have the correct source address, but it is leaving via the wrong interface. Many ISPs filter traffic entering their network and will block traffic from your network with source IPs outside your allocated range. To an ISP this looks like spoofed traffic.
The solution is marvelously elegant and simple. Select one IP on the internal server which will be reachable via one provider and one IP which will be reachable via the other provider. By using two IP addresses on the internal machine, we can use ip rule on
masq-gw to select a routing table with a different default route based upon the source IP of the response packets to clients. Below, we'll assume the same routing tables as in the previous section (cf. Section 10.4.1, "Outbound traffic Using Multiple Connections to the Internet").
Here we have a server
isolde which needs to be accessible via two different public IP addresses. We'll add an IP address to
isolde so that it is reachable on 192.168.100.10 as well as 192.168.100.17. Then, the following rules on
masq-gw will ensure that packets are rewritten and routed in order to avoid the problem pointed out above.
Example 10.4. Multiple Internet links, inbound traffic; using iproute2 only 
10.4.3. Using Multiple Connections to the Internet for Inbound and Outbound Connections
 Anybody who has any experience with linux as a firewall behind a BGP device? Linux as a firewall/router running BGP? Thoughts? Things I should include here? Yeah, I know about Zebra, but I haven't ever used it.
 Sometimes it may not be quite proper to simply copy the main routing table to another routing table. You may want a subset of hosts on the internal network to access the alternate link. Anybody have any sage advice here for the newbie in multiple routing tables?
 This example makes no reference to packet filtering. If you are reading this, I assume you are competent at determining the packet filtering issues. If you have doubts about what rules to add, see Section 5.4, "Stateless NAT and Packet Filtering".