| This document describes how to use the kernel's L2TP drivers to |
| provide L2TP functionality. L2TP is a protocol that tunnels one or |
| more sessions over an IP tunnel. It is commonly used for VPNs |
| (L2TP/IPSec) and by ISPs to tunnel subscriber PPP sessions over an IP |
| network infrastructure. With L2TPv3, it is also useful as a Layer-2 |
| tunneling infrastructure. |
| |
| Features |
| ======== |
| |
| L2TPv2 (PPP over L2TP (UDP tunnels)). |
| L2TPv3 ethernet pseudowires. |
| L2TPv3 PPP pseudowires. |
| L2TPv3 IP encapsulation. |
| Netlink sockets for L2TPv3 configuration management. |
| |
| History |
| ======= |
| |
| The original pppol2tp driver was introduced in 2.6.23 and provided |
| L2TPv2 functionality (rfc2661). L2TPv2 is used to tunnel one or more PPP |
| sessions over a UDP tunnel. |
| |
| L2TPv3 (rfc3931) changes the protocol to allow different frame types |
| to be passed over an L2TP tunnel by moving the PPP-specific parts of |
| the protocol out of the core L2TP packet headers. Each frame type is |
| known as a pseudowire type. Ethernet, PPP, HDLC, Frame Relay and ATM |
| pseudowires for L2TP are defined in separate RFC standards. Another |
| change for L2TPv3 is that it can be carried directly over IP with no |
| UDP header (UDP is optional). It is also possible to create static |
| unmanaged L2TPv3 tunnels manually without a control protocol |
| (userspace daemon) to manage them. |
| |
| To support L2TPv3, the original pppol2tp driver was split up to |
| separate the L2TP and PPP functionality. Existing L2TPv2 userspace |
| apps should be unaffected as the original pppol2tp sockets API is |
| retained. L2TPv3, however, uses netlink to manage L2TPv3 tunnels and |
| sessions. |
| |
| Design |
| ====== |
| |
| The L2TP protocol separates control and data frames. The L2TP kernel |
| drivers handle only L2TP data frames; control frames are always |
| handled by userspace. L2TP control frames carry messages between L2TP |
| clients/servers and are used to setup / teardown tunnels and |
| sessions. An L2TP client or server is implemented in userspace. |
| |
| Each L2TP tunnel is implemented using a UDP or L2TPIP socket; L2TPIP |
| provides L2TPv3 IP encapsulation (no UDP) and is implemented using a |
| new l2tpip socket family. The tunnel socket is typically created by |
| userspace, though for unmanaged L2TPv3 tunnels, the socket can also be |
| created by the kernel. Each L2TP session (pseudowire) gets a network |
| interface instance. In the case of PPP, these interfaces are created |
| indirectly by pppd using a pppol2tp socket. In the case of ethernet, |
| the netdevice is created upon a netlink request to create an L2TPv3 |
| ethernet pseudowire. |
| |
| For PPP, the PPPoL2TP driver, net/l2tp/l2tp_ppp.c, provides a |
| mechanism by which PPP frames carried through an L2TP session are |
| passed through the kernel's PPP subsystem. The standard PPP daemon, |
| pppd, handles all PPP interaction with the peer. PPP network |
| interfaces are created for each local PPP endpoint. The kernel's PPP |
| subsystem arranges for PPP control frames to be delivered to pppd, |
| while data frames are forwarded as usual. |
| |
| For ethernet, the L2TPETH driver, net/l2tp/l2tp_eth.c, implements a |
| netdevice driver, managing virtual ethernet devices, one per |
| pseudowire. These interfaces can be managed using standard Linux tools |
| such as "ip" and "ifconfig". If only IP frames are passed over the |
| tunnel, the interface can be given an IP addresses of itself and its |
| peer. If non-IP frames are to be passed over the tunnel, the interface |
| can be added to a bridge using brctl. All L2TP datapath protocol |
| functions are handled by the L2TP core driver. |
| |
| Each tunnel and session within a tunnel is assigned a unique tunnel_id |
| and session_id. These ids are carried in the L2TP header of every |
| control and data packet. (Actually, in L2TPv3, the tunnel_id isn't |
| present in data frames - it is inferred from the IP connection on |
| which the packet was received.) The L2TP driver uses the ids to lookup |
| internal tunnel and/or session contexts to determine how to handle the |
| packet. Zero tunnel / session ids are treated specially - zero ids are |
| never assigned to tunnels or sessions in the network. In the driver, |
| the tunnel context keeps a reference to the tunnel UDP or L2TPIP |
| socket. The session context holds data that lets the driver interface |
| to the kernel's network frame type subsystems, i.e. PPP, ethernet. |
| |
| Userspace Programming |
| ===================== |
| |
| For L2TPv2, there are a number of requirements on the userspace L2TP |
| daemon in order to use the pppol2tp driver. |
| |
| 1. Use a UDP socket per tunnel. |
| |
| 2. Create a single PPPoL2TP socket per tunnel bound to a special null |
| session id. This is used only for communicating with the driver but |
| must remain open while the tunnel is active. Opening this tunnel |
| management socket causes the driver to mark the tunnel socket as an |
| L2TP UDP encapsulation socket and flags it for use by the |
| referenced tunnel id. This hooks up the UDP receive path via |
| udp_encap_rcv() in net/ipv4/udp.c. PPP data frames are never passed |
| in this special PPPoX socket. |
| |
| 3. Create a PPPoL2TP socket per L2TP session. This is typically done |
| by starting pppd with the pppol2tp plugin and appropriate |
| arguments. A PPPoL2TP tunnel management socket (Step 2) must be |
| created before the first PPPoL2TP session socket is created. |
| |
| When creating PPPoL2TP sockets, the application provides information |
| to the driver about the socket in a socket connect() call. Source and |
| destination tunnel and session ids are provided, as well as the file |
| descriptor of a UDP socket. See struct pppol2tp_addr in |
| include/linux/if_pppol2tp.h. Note that zero tunnel / session ids are |
| treated specially. When creating the per-tunnel PPPoL2TP management |
| socket in Step 2 above, zero source and destination session ids are |
| specified, which tells the driver to prepare the supplied UDP file |
| descriptor for use as an L2TP tunnel socket. |
| |
| Userspace may control behavior of the tunnel or session using |
| setsockopt and ioctl on the PPPoX socket. The following socket |
| options are supported:- |
| |
| DEBUG - bitmask of debug message categories. See below. |
| SENDSEQ - 0 => don't send packets with sequence numbers |
| 1 => send packets with sequence numbers |
| RECVSEQ - 0 => receive packet sequence numbers are optional |
| 1 => drop receive packets without sequence numbers |
| LNSMODE - 0 => act as LAC. |
| 1 => act as LNS. |
| REORDERTO - reorder timeout (in millisecs). If 0, don't try to reorder. |
| |
| Only the DEBUG option is supported by the special tunnel management |
| PPPoX socket. |
| |
| In addition to the standard PPP ioctls, a PPPIOCGL2TPSTATS is provided |
| to retrieve tunnel and session statistics from the kernel using the |
| PPPoX socket of the appropriate tunnel or session. |
| |
| For L2TPv3, userspace must use the netlink API defined in |
| include/linux/l2tp.h to manage tunnel and session contexts. The |
| general procedure to create a new L2TP tunnel with one session is:- |
| |
| 1. Open a GENL socket using L2TP_GENL_NAME for configuring the kernel |
| using netlink. |
| |
| 2. Create a UDP or L2TPIP socket for the tunnel. |
| |
| 3. Create a new L2TP tunnel using a L2TP_CMD_TUNNEL_CREATE |
| request. Set attributes according to desired tunnel parameters, |
| referencing the UDP or L2TPIP socket created in the previous step. |
| |
| 4. Create a new L2TP session in the tunnel using a |
| L2TP_CMD_SESSION_CREATE request. |
| |
| The tunnel and all of its sessions are closed when the tunnel socket |
| is closed. The netlink API may also be used to delete sessions and |
| tunnels. Configuration and status info may be set or read using netlink. |
| |
| The L2TP driver also supports static (unmanaged) L2TPv3 tunnels. These |
| are where there is no L2TP control message exchange with the peer to |
| setup the tunnel; the tunnel is configured manually at each end of the |
| tunnel. There is no need for an L2TP userspace application in this |
| case -- the tunnel socket is created by the kernel and configured |
| using parameters sent in the L2TP_CMD_TUNNEL_CREATE netlink |
| request. The "ip" utility of iproute2 has commands for managing static |
| L2TPv3 tunnels; do "ip l2tp help" for more information. |
| |
| Debugging |
| ========= |
| |
| The driver supports a flexible debug scheme where kernel trace |
| messages may be optionally enabled per tunnel and per session. Care is |
| needed when debugging a live system since the messages are not |
| rate-limited and a busy system could be swamped. Userspace uses |
| setsockopt on the PPPoX socket to set a debug mask. |
| |
| The following debug mask bits are available: |
| |
| PPPOL2TP_MSG_DEBUG verbose debug (if compiled in) |
| PPPOL2TP_MSG_CONTROL userspace - kernel interface |
| PPPOL2TP_MSG_SEQ sequence numbers handling |
| PPPOL2TP_MSG_DATA data packets |
| |
| If enabled, files under a l2tp debugfs directory can be used to dump |
| kernel state about L2TP tunnels and sessions. To access it, the |
| debugfs filesystem must first be mounted. |
| |
| # mount -t debugfs debugfs /debug |
| |
| Files under the l2tp directory can then be accessed. |
| |
| # cat /debug/l2tp/tunnels |
| |
| The debugfs files should not be used by applications to obtain L2TP |
| state information because the file format is subject to change. It is |
| implemented to provide extra debug information to help diagnose |
| problems.) Users should use the netlink API. |
| |
| /proc/net/pppol2tp is also provided for backwards compatibility with |
| the original pppol2tp driver. It lists information about L2TPv2 |
| tunnels and sessions only. Its use is discouraged. |
| |
| Unmanaged L2TPv3 Tunnels |
| ======================== |
| |
| Some commercial L2TP products support unmanaged L2TPv3 ethernet |
| tunnels, where there is no L2TP control protocol; tunnels are |
| configured at each side manually. New commands are available in |
| iproute2's ip utility to support this. |
| |
| To create an L2TPv3 ethernet pseudowire between local host 192.168.1.1 |
| and peer 192.168.1.2, using IP addresses 10.5.1.1 and 10.5.1.2 for the |
| tunnel endpoints:- |
| |
| # modprobe l2tp_eth |
| # modprobe l2tp_netlink |
| |
| # ip l2tp add tunnel tunnel_id 1 peer_tunnel_id 1 udp_sport 5000 \ |
| udp_dport 5000 encap udp local 192.168.1.1 remote 192.168.1.2 |
| # ip l2tp add session tunnel_id 1 session_id 1 peer_session_id 1 |
| # ifconfig -a |
| # ip addr add 10.5.1.2/32 peer 10.5.1.1/32 dev l2tpeth0 |
| # ifconfig l2tpeth0 up |
| |
| Choose IP addresses to be the address of a local IP interface and that |
| of the remote system. The IP addresses of the l2tpeth0 interface can be |
| anything suitable. |
| |
| Repeat the above at the peer, with ports, tunnel/session ids and IP |
| addresses reversed. The tunnel and session IDs can be any non-zero |
| 32-bit number, but the values must be reversed at the peer. |
| |
| Host 1 Host2 |
| udp_sport=5000 udp_sport=5001 |
| udp_dport=5001 udp_dport=5000 |
| tunnel_id=42 tunnel_id=45 |
| peer_tunnel_id=45 peer_tunnel_id=42 |
| session_id=128 session_id=5196755 |
| peer_session_id=5196755 peer_session_id=128 |
| |
| When done at both ends of the tunnel, it should be possible to send |
| data over the network. e.g. |
| |
| # ping 10.5.1.1 |
| |
| |
| Sample Userspace Code |
| ===================== |
| |
| 1. Create tunnel management PPPoX socket |
| |
| kernel_fd = socket(AF_PPPOX, SOCK_DGRAM, PX_PROTO_OL2TP); |
| if (kernel_fd >= 0) { |
| struct sockaddr_pppol2tp sax; |
| struct sockaddr_in const *peer_addr; |
| |
| peer_addr = l2tp_tunnel_get_peer_addr(tunnel); |
| memset(&sax, 0, sizeof(sax)); |
| sax.sa_family = AF_PPPOX; |
| sax.sa_protocol = PX_PROTO_OL2TP; |
| sax.pppol2tp.fd = udp_fd; /* fd of tunnel UDP socket */ |
| sax.pppol2tp.addr.sin_addr.s_addr = peer_addr->sin_addr.s_addr; |
| sax.pppol2tp.addr.sin_port = peer_addr->sin_port; |
| sax.pppol2tp.addr.sin_family = AF_INET; |
| sax.pppol2tp.s_tunnel = tunnel_id; |
| sax.pppol2tp.s_session = 0; /* special case: mgmt socket */ |
| sax.pppol2tp.d_tunnel = 0; |
| sax.pppol2tp.d_session = 0; /* special case: mgmt socket */ |
| |
| if(connect(kernel_fd, (struct sockaddr *)&sax, sizeof(sax) ) < 0 ) { |
| perror("connect failed"); |
| result = -errno; |
| goto err; |
| } |
| } |
| |
| 2. Create session PPPoX data socket |
| |
| struct sockaddr_pppol2tp sax; |
| int fd; |
| |
| /* Note, the target socket must be bound already, else it will not be ready */ |
| sax.sa_family = AF_PPPOX; |
| sax.sa_protocol = PX_PROTO_OL2TP; |
| sax.pppol2tp.fd = tunnel_fd; |
| sax.pppol2tp.addr.sin_addr.s_addr = addr->sin_addr.s_addr; |
| sax.pppol2tp.addr.sin_port = addr->sin_port; |
| sax.pppol2tp.addr.sin_family = AF_INET; |
| sax.pppol2tp.s_tunnel = tunnel_id; |
| sax.pppol2tp.s_session = session_id; |
| sax.pppol2tp.d_tunnel = peer_tunnel_id; |
| sax.pppol2tp.d_session = peer_session_id; |
| |
| /* session_fd is the fd of the session's PPPoL2TP socket. |
| * tunnel_fd is the fd of the tunnel UDP socket. |
| */ |
| fd = connect(session_fd, (struct sockaddr *)&sax, sizeof(sax)); |
| if (fd < 0 ) { |
| return -errno; |
| } |
| return 0; |
| |
| Internal Implementation |
| ======================= |
| |
| The driver keeps a struct l2tp_tunnel context per L2TP tunnel and a |
| struct l2tp_session context for each session. The l2tp_tunnel is |
| always associated with a UDP or L2TP/IP socket and keeps a list of |
| sessions in the tunnel. The l2tp_session context keeps kernel state |
| about the session. It has private data which is used for data specific |
| to the session type. With L2TPv2, the session always carried PPP |
| traffic. With L2TPv3, the session can also carry ethernet frames |
| (ethernet pseudowire) or other data types such as ATM, HDLC or Frame |
| Relay. |
| |
| When a tunnel is first opened, the reference count on the socket is |
| increased using sock_hold(). This ensures that the kernel socket |
| cannot be removed while L2TP's data structures reference it. |
| |
| Some L2TP sessions also have a socket (PPP pseudowires) while others |
| do not (ethernet pseudowires). We can't use the socket reference count |
| as the reference count for session contexts. The L2TP implementation |
| therefore has its own internal reference counts on the session |
| contexts. |
| |
| To Do |
| ===== |
| |
| Add L2TP tunnel switching support. This would route tunneled traffic |
| from one L2TP tunnel into another. Specified in |
| http://tools.ietf.org/html/draft-ietf-l2tpext-tunnel-switching-08 |
| |
| Add L2TPv3 VLAN pseudowire support. |
| |
| Add L2TPv3 IP pseudowire support. |
| |
| Add L2TPv3 ATM pseudowire support. |
| |
| Miscellaneous |
| ============= |
| |
| The L2TP drivers were developed as part of the OpenL2TP project by |
| Katalix Systems Ltd. OpenL2TP is a full-featured L2TP client / server, |
| designed from the ground up to have the L2TP datapath in the |
| kernel. The project also implemented the pppol2tp plugin for pppd |
| which allows pppd to use the kernel driver. Details can be found at |
| http://www.openl2tp.org. |