Skip to main content

Understanding coupling dynamics of public transportation networks

Abstract

Subway and bus networks work as an integrated multiplex transportation system and play an indispensable role in modern big cities. Even though a variety of works have investigated the coupling dynamics of multiplex transportation networks, empirical data that validates the determinant coupling factors are still lacking. In this paper, we employ smartcard data of 2.4 million subway and bus passengers in Shenzhen, China to study the coupling dynamics of subway and bus networks. Surprisingly, the coupling of subway and bus networks is not notably influenced by the time-varying speed ratio of the two network layers but is jointly determined by the distribution of travel demands and transportation facilities. Our findings highlight the important role of real travel demand data in analyzing the coupling dynamics of multiplex transportation networks. They also suggest that the speed ratio of different network layers, which was regarded as a key factor in determining coupling strength, has a negligible effect on travelers’ route selections, and thus the coupling dynamics of multiplex transportation networks.

1 Introduction

Multiplex networks are ubiquitous in modern human societies. Representative types of multiplex networks range from social networks [1] and biological networks [2] to power grids [3] and transportation networks [4, 5]. In multiplex networks, network layers are not isolated entities but interact with each other as an integrated system [6]. In recent years, a variety of new and interesting properties of percolation processes [7–11], cascade failures [8, 12], epidemic spreading [13–20], and coupling dynamics [21–24] on multiplex networks have been discovered. Coupling is a very important and interesting research topic because it occurs on real networks in our daily life. In big cities, several layers of transportation infrastructure work together to supply the continuously growing urban mobility demand [25]. The coupling of different layers of transportation infrastructure is generated by travelers using more than one mode of transportation to complete a trip. The most common examples include the “Park and Ride” mode [26] that generates coupling between road and subway networks and passenger transfer that generates coupling between bus and subway networks.

Previous works discovered a variety of interacting features of multiplex transportation networks. Morris and Barthélemy found that the shortest path length decreases as the coupling increases on a spatial network [21]. Gallotti and Barthélemy studied how the riding, walking and waiting time of trips vary with distance [22]. Zhou et al. proposed a layered network model for transportation systems [23]. The authors found the transition point of cooperation between layered networks [23]. Strano et al. analyzed the coupling property between street and subway networks in London and New York [24]. An optimal operation speed is determined for London subway [24]. How operation speed of subway influences passenger travel time and road traffic congestion was studied using CDR (Call Details Records) data [27]. The results indicated that travel demand plays crucial quantitative and qualitative roles [27]. How interconnections between network layers influence traffic congestion [28] and how speed ratio influences network capacity and travel time [29] were also investigated.

Despite the rapid developments in coupling dynamics of multiplex transportation networks, most existing works are based on simulated networks or simulated travel demands. This may lead to findings inconsistent with empirical evidence. First, the actual topology of a transportation network is usually much more complicated than the topology of a simulated network. Second, the obtained coupling results usually deviate from real situations when using uniformly (randomly) generated travel demands [27]. Third, mobile phone data [30, 31] cannot be used to capture the transfers between different network layers either. Finally, the inconvenience of transfers and individual preferences cannot be evaluated and incorporated in building the models. In summary, previous works had to make strong assumptions on travelers’ routes and mode-selection behavior due to the lack of data.

The recent emerging General Transit Feed Specification (GTFS) service provides both static and real-time transit data, which include transit route data, schedule data, real-time bus position data, etc. Geographic transit service data formatted in GTFS have been widely used in transportation analysis and applications [32]. Wong used GTFS data to analyze transit headways for single transport agency and multiple transport agencies [33]. Hadas employed GTFS data and geographic information data of transportation networks to analyze the connectivity indicators (network coverage, operation speed, intersection coverage, stop transfer potential and route overlap) of public transportation systems [34]. Tao studied the spatial-temporal travel behavior of bus passengers using GTFS data [35]. Lee et al. used GTFS data to investigate the symmetry of boarding and alighting in time and space [36]. Huang et al. combined GTFS data and taxi GPS data to identify and anticipate large crowd gatherings [37].

Using a set of data similar to GTFS data, we studied the coupling dynamics of the multiplex transportation network in Shenzhen, China. The geographic information systems (GIS) data of subway and bus networks, and the transit route data were used to obtain accurate topology of the multiplex transportation network. Smartcard data of bus and subway passengers were analyzed to estimate dynamical passenger travel demands. The GPS data of buses were used to estimate the road traffic condition. Real transfers between subway and bus networks can be captured using the real large-scale data. In the present study, we focus on one important question: What are the major determining factors for the coupling strength of subway and bus networks?

2 Data

In this section, we briefly introduce the datasets used. More details about the used data are provided in the Additional file 1.

The GIS data of subway and bus networks provide geographic locations of each bus stop and each subway station in Shenzhen, China (Fig. S6). In the studied multiplex public transportation network, the number of bus stops (9114 in total) is much larger than the number of subway stations (132 in total).

Bus transit route data and subway transit route data provide the stop information of all buses and the station information of all subway lines. The data also provide the basic attributes (length, running time) of the subway sections.

The GIS data of traffic zones in Shenzhen were provided by Shenzhen Transportation Authority. In this dataset, the geographic area of Shenzhen is partitioned into 996 traffic zones, within which 844 zones contain at least one bus stop or one subway station (Fig. 1(a)).

Figure 1
figure 1

Illustration of a multiplex transportation network. (a) Multiplex public transportation network of Shenzhen. Bus stops (light green dots), subway stations (red squares), transfer zones (orange polygons), ordinary zones (grey polygons), and four inter-links (dashed lines) between the two network layers are illustrated. (b) Path depicted in blue represents a unimodal subway trip. Path depicted in green represents a multimode trip using both bus and subway transportation. (c)–(e) SDNs of one destination zone during 7:00 a.m.–8:00 a.m., 1:00 p.m.–2:00 p.m., and 7:00 p.m.–8:00 p.m.

Smartcard data recorded the trips of more than 2.4 million smartcard users on 915 bus lines and 6 subway lines in Shenzhen (Figs. S1–S4). Each smartcard has a unique ID number. During the 1-week data collection period (August 8–15, 2016), 48 million transaction records were generated. The time and station that a passenger enters or exits the subway were recorded, whereas only the information of the bus boarded and the boarding time were recorded for bus passengers. The time that a bus arrived at each bus stop was inferred by matching the GPS coordinates of the bus to the stops along the bus route. This generates the timetable of the bus. Consequently, the boarding time of each bus passenger was compared with the timetable to identify the bus stop where each passenger got on board. Given that bus passengers do not tap cards when getting off a bus, the destination information is not available. In the following section, we use a four-step method to infer the destination of each bus trip.

3 Methods

3.1 Estimating the destination of a bus trip

We propose a four-step method to estimate the destinations of the bus trips. The method is elaborated as follows:

Step 1: Two assumptions proposed by Barry et al. [38] are used: (1) a passenger will start a new trip near the destination of the current trip. Therefore, the bus stop which is closest to (and within 2 km of) the boarding stop or station of the next trip is inferred as the destination stop of the current bus trip. Here, 2 km is selected as the maximum radius to search candidate stops, because previous works [38–41] suggest that the maximum radius of individual walking activities is 1–2 km. (2) at the end of a day, a passenger will return to the origin of his/her first trip of the day. Therefore, the bus stop which is closest to (and within 2 km of) the boarding stop or station of the first trip of a day is inferred as the destination stop of the last trip of the day.

Step 2: If a passenger only has one bus trip during a day, the bus stop which is closest to (and within 2 km of) the origin of the first trip in the next day is inferred as the destination [39]. If for a passenger the destination of the last bus trip cannot be found within 2 km of the origin of the first trip of the day, the bus stop which is closest to (and within 2 km of) the origin of the first trip in the next day is inferred as the destination [39].

Step 3: If the destination of a bus trip cannot be identified in Step 1 and 2, the historical smartcard records of the bus passenger [39] are used. Bus trips that share the same route, same boarding stop, and similar boarding time with the unidentified bus trip are analyzed. The similar historical bus trip that has the most similar boarding time to the unidentified trip is identified. And the destination of the historical bus trip is inferred as the destination of the unidentified bus trip.

Step 4: If the destination of a bus trip cannot be identified in Step 1, 2 and 3 (approximately 9% of all bus trips), the most frequent alighting bus stop after the boarding stop is selected as the destination of the bus trip.

Using the proposed method, destinations are identified for 97% of the bus trips. Once the destination of a bus trip is estimated, we infer the alighting time of the bus passenger according to the arrival time of the bus at the bus stop.

3.2 Identifying transfers in the multiplex transportation network

The methods to identify transfers can be classified into two types [40–48]. One method uses the time interval between the starting times of two consecutive trips as the criterion to determine if two consecutive trips belong to a whole trip. The other method uses the time interval between the end time of the first trip and the starting time of the following second trip as the criterion. The advantage of the first method is that the estimation of the time interval is accurate because the boarding time of a bus or subway passenger is always recorded. However, this method may not work well when the travel time of the first trip is too short or too long. The second method avoids the drawback of the first method, however, the second method depends on accurate estimation of the destinations of bus trips. Here, we combine the two methods to identify transfers:

Step 1: If the time interval between the end of the first trip and the start of the second trip is within 30 min, a transfer is identified between the two consecutive trips; otherwise, the two trips are identified as separate trips.

Step 2: If the destination information of the first trip is not available, another criterion is applied. If the time interval between the starting times of two consecutive trips is within 60 min, a transfer is identified between the two consecutive trips; otherwise, the two trips are identified as separate trips.

In the present study, only multimode trips with transfers between subway network and bus network are used to estimate the coupling strength between subway network and bus network.

3.3 Generating time-varying single-destination networks

Each observational day is split into 17 1-hour time windows from 6:00 a.m. to 10:00 p.m. that cover the main service period of public transportation in Shenzhen. Passenger trips are grouped into a time window according to the time (1-hour time window) when the trip started. Single-destination network (SDN) of each time window is generated based on the trips started during the time window (Figs. 1(c)–(e)). In an SDN, nodes represent traffic zones, and an origin zone is connected to the unique destination zone if at least one qualified trip is observed between the two zones during the time window. Qualified trips include:

  1. (a)

    Multimode trips that use both bus and subway services.

  2. (b)

    Bus trips with alternative multimode routes connecting the origin and destination zones.

  3. (c)

    Subway trips with alternative multimode routes connecting the origin and destination zones.

These criteria ensure that all unimodal trips have alternative multimode routes to the destination zone. In the following analysis, only qualified trips are used. For each SDN, there are several origin zones and one destination zone. For each destination zone, there are 17 SDNs generated during a day.

In the present study, traffic zones containing at least one bus stop and one subway station are defined as transfer zones (124 in total), and the other traffic zones are defined as ordinary zones (Fig. 1(b)). Here, we only analyzed the SDNs with destinations located at transfer zones (Fig. S5). This is because both bus-to-subway trips and subway-to-bus trips exist in these SDNs. While for the SDNs with destinations located at ordinary zones, only bus-to-subway trips or subway-to-bus trips can be observed. The transfer zones are popular destinations, roughly 70% of the trips are captured by the 124 SDNs.

4 Results

For each SDN i of a time window t, we measure the coupling strength \(\lambda_{i} ( t )\) between the subway network and the bus network as

$$ \lambda_{i} ( t ) = \frac{n_{i} (t)^{\mathrm{coupled}}}{n_{i} (t)}, $$
(1)

where \(n_{i} (t)^{\mathrm{coupled}}\) and \(n_{i} (t)\) denote the number of multimode trips and the total number of trips in i. Given that SDNs may have different scales of coupling strengths, we normalize the coupling strength \(\lambda_{i} ( t )\) of each SDN when analyzing the temporal patterns of \(\lambda_{i} ( t )\):

$$ \lambda_{i}^{\mathrm{norm}} ( t ) = \frac{\lambda_{i} ( t ) - \lambda_{i}^{\mathrm{min}}}{\lambda_{i}^{\mathrm{max}} - \lambda_{i}^{\mathrm{min}}}, $$
(2)

where \(\lambda_{i}^{\mathrm{max}}\) and \(\lambda_{i}^{\mathrm{min}}\) denote the maximum and minimum values of \(\lambda_{i} ( t )\) in the day.

Using the smartcard data and the bus GPS data, we estimate the time-varying operation speeds of subway network and bus network for each SDN. We find that the operation speed of subway is stable during the whole day, whereas the operation speeds of buses are influenced by road traffic conditions (Fig. S7). Here, the speed ratio of an SDN i during time window t is defined as the average trip speed on the subway network over the average trip speed on the bus network. The average speed of a trip is calculated using the trip distance divided by the travel time:

$$\begin{aligned} &\overline{v}_{i}^{s} ( t ) = \frac{1}{N_{s}} \sum v_{i,j}^{s}, \end{aligned}$$
(3)
$$\begin{aligned} &\overline{v}_{i}^{b} ( t ) = \frac{1}{N_{b}} \sum v_{i,k}^{b}, \end{aligned}$$
(4)
$$\begin{aligned} &\beta_{i} ( t ) = \frac{\overline{v}_{i}^{s} ( t )}{ \overline{v}_{i}^{b} ( t )}, \end{aligned}$$
(5)

where \(v_{i,j}^{s}\) is the speed of subway trip j and \(v_{i,k}^{b}\) the speed of bus trip k, \(\overline{v}_{i}^{s} ( t )\) and \(\overline{v}_{i}^{b} ( t )\) are the average speed on the subway layer and bus layer of SDN i during time window t. When comparing \(\beta_{i} ( t )\) of different SDNs, we normalize the speed ratio using the maximum–minimum normalization method:

$$ \beta_{i}^{\mathrm{norm}} ( t ) = \frac{\beta_{i} ( t ) - \beta_{i}^{\mathrm{min}}}{\beta_{i}^{\mathrm{max}} - \beta_{i}^{\mathrm{min}}}, $$
(6)

where \(\beta_{i}^{\mathrm{max}}\) and \(\beta_{i}^{\mathrm{min}}\) denote the maximum and minimum values of \(\beta_{i} ( t )\) in the day.

4.1 Preliminary observation on three (case studies) SDNs

Three SDNs with destination zones located at a commercial zone (SDN 1), a transportation hub zone (SDN 2), and a residential zone (SDN 3) are analyzed (Fig. 2). The coupling strengths \(\lambda_{i}^{\mathrm{norm}} ( t )\) of the SDN 1 and SDN 2 reached the maximum in the morning rush hours (6:00 a.m.–7:00 a.m.), whereas the coupling strength of SDN 3 reached the maximum in the evening rush hours (5:00 p.m.–7:00 p.m.) (Figs. 2(a)–(c)). The speed ratio \(\beta_{i}^{\mathrm{norm}} ( t )\) is larger in the morning and evening rush hours for all three SDNs, because the operation speeds of buses are lower in peak hours. The speed ratio \(\beta_{i}^{\mathrm{norm}} ( t )\) was considered as the key factor influencing the mode-selection behavior of travelers and, thus, the coupling strength of layered networks [21, 24, 29]. However, our empirical results indicate that passengers did not react to speed ratio changes (Figs. 2(d)–(f)). No strong correlation is found between coupling strength \(\lambda_{i}^{\mathrm{norm}} ( t )\) and speed ratio \(\beta_{i}^{\mathrm{norm}} ( t )\) for all three SDNs (Figs. 3(a), (d), (g)).

Figure 2
figure 2

Temporal features of coupling strength \(\lambda_{i}^{\mathrm{norm}} (t )\) and speed ratio \(\beta_{i}^{\mathrm{norm}} ( t )\). SDNs with destinations at a commercial zone (SDN 1) (panel (a), (d)), a transportation hub zone (SDN 2) (panel (b), (e)) and a residential zone (SDN 3) (panel (c), (f))

Figure 3
figure 3

Pearson Correlation Coefficient (PCC) between coupling strength and three indexes. PCC between coupling strength and speed ratio (left column), PCC between coupling strength and average trip distance (middle column), and PCC between coupling strength and fraction of travelers from ordinary zones (right column) for SDNs with destinations at (a)–(c) a commercial zone, (d)–(f) a transportation hub zone and (g)–(i) a residential zone. Normalized parameters are denoted \(\lambda^{\mathrm{norm}}\), \(\beta^{\mathrm{norm}}\), \(d^{\mathrm{norm}}\), and \(p^{\mathrm{norm}}\). Blue lines denote the regression functions

One intuition is that as trip distance increases, a passenger is more likely to use both bus network and subway network. Therefore, the correlation between average trip distance and coupling strength is analyzed. Here, average trip distance is defined as the average Euclidean distance from each origin to the single destination of an SDN, weighted by the number of trips from each origin:

$$ d_{i} ( t ) = \sum_{o\in O_{i} (t)} \frac{n_{o_{i}} (t)}{n_{i} (t)} d_{oi}, $$
(7)

where \(O_{i} (t)\) represents the set of origins of SDN i during time window t, \(n_{o_{i}} (t)\) represents the number of trips from origin o, and \(d_{oi}\) represents the Euclidean distance between o and the destination zone. We normalize the average trip distance using the maximum–minimum normalization method:

$$ d_{i}^{\mathrm{norm}} ( t ) = \frac{d_{i} ( t ) - d_{i}^{\mathrm{min}}}{ d_{i}^{\mathrm{max}} - d_{i}^{\mathrm{min}}}, $$
(8)

where \(d_{i}^{\mathrm{max}}\) and \(d_{i}^{\mathrm{min}}\) denote the maximum and minimum values of \(d_{i} ( t )\) in the day. We observe good positive correlations between coupling strength \(\lambda_{i}^{\mathrm{norm}} ( t )\) and average trip distance \(d_{i}^{\mathrm{norm}} ( t )\) for SDN 1 and SDN 3 (Figs. 3(b), (h)). However, the correlation between \(\lambda_{i}^{\mathrm{norm}} ( t )\) and \(d_{i}^{\mathrm{norm}} ( t )\) is weak for the SDN 2 (Fig. 3(e)). Possible explanations are: there are abundant direct public transportation service to the transportation hub, and the transportation hub serves more as the destination rather than the transfer place for passengers (44,931 trips vs. 5262 trips on Monday).

Another experience in our daily life is that trips originating from suburban areas may have a higher possibility of using both subway and bus transportation. This is because no subway service is available in suburban areas, and travelers may first use bus transportation to reach a subway station. Therefore, we define the origin composition index p, the fraction of trips from ordinary zones:

$$ p_{i} ( t ) = \frac{n_{i} (t)^{\mathrm{ordinary}}}{n_{i} (t)}, $$
(9)

where \(n_{i} ( t )^{\mathrm{ordinary}}\) and \(n_{i} (t)\) denote the number of trips originating from ordinary zones and the total number of trips of SDN i in time window t. We normalize \(p_{i} ( t )\) using the maximum–minimum normalization method:

$$ p_{i}^{\mathrm{norm}} ( t ) = \frac{p_{i} ( t ) - p_{i}^{\mathrm{min}}}{ p_{i}^{\mathrm{max}} - p_{i}^{\mathrm{min}}}, $$
(10)

where \(p_{i}^{\mathrm{max}}\) and \(p_{i}^{\mathrm{min}}\) denote the maximum and minimum values of \(p_{i} ( t )\) in the day. Strong positive correlations are observed between \(\lambda_{i}^{\mathrm{norm}} ( t )\) and \(p_{i}^{\mathrm{norm}} ( t )\) for SDN 1 and SDN 2 (Figs. 3(c), (f)). However, a negative correlation is observed for SDN 3 (Fig. 3(i)). Possible reason could be that the trip distance is relatively short (the average trip distance is 8 km during a day). Even when a large fraction of passengers come from ordinary zones, they can use direct public transportation service or the travel time cannot be obviously reduced when transferring to the faster subway network.

4.2 Statistical analysis on all 124 SDNs

The results of the three case studies indicate that no single index can determine the coupling strength of all SDNs. In addition, different land use features may influence the route and mode selection behavior of travelers. In the present study, without confident and accurate land use information, we focus on the general trend rather than the case feature of the determining factor of the coupling strength.

We first find that average coupling strength \(\langle \lambda_{i}^{\mathrm{norm}} ( t ) \rangle\) is high in morning rush hours and decreases afterwards for both weekdays and weekends (Fig. 4(a)). This can be explained by that most studied SDNs attract early morning commuters who must use multimode transport to ensure punctuality. Compared with weekday results, \(\langle \lambda_{i}^{\mathrm{norm}} ( t ) \rangle\) is lower in morning, but larger for other time windows on weekends. This can be explained by that there are less commuters but more afternoon activities on weekends.

Figure 4
figure 4

Temporal patterns of (a) coupling strength \(\langle \lambda^{\mathrm{norm}} \rangle\), (b) speed ratio \(\langle \beta^{\mathrm{norm}} \rangle\), (c) average trip distance \(\langle d^{\mathrm{norm}} \rangle\), and (d) fraction of trips from ordinary zones \(\langle p^{\mathrm{norm}} \rangle\). The \(\langle \dots \rangle\) symbol denotes the average of the index on all studied SDNs. Error bars represent the variance of the index values

The average speed ratios on weekdays and Saturday have similar temporal patterns. The speed ratio did not have a peak on Sunday evening (Fig. 4(b)). The coupling strength did not increase with the speed ratio during evening rush hours (Figs. 4(a), (b)). No prominent correlation is found between the coupling strength and the speed ratio on the studied 124 SDNs (Fig. 5(a)). The average trip distances on the studied SDNs show similar patterns for all weekdays and weekends (Fig. 4(c)). We observed a moderate positive correlation between coupling strength \(\langle \lambda_{i}^{\mathrm{norm}} ( t ) \rangle\) and average trip distance \(\langle d_{i}^{\mathrm{norm}} ( t ) \rangle\) (Fig. 5(b)). The origin composition index \(\langle p_{i}^{\mathrm{norm}} ( t ) \rangle\) decreases slightly faster than \(\langle \lambda_{i}^{\mathrm{norm}} ( t ) \rangle\) after the morning rush hours (Fig. 4(d)). A strong positive correlation is observed between \(\langle \lambda_{i}^{\mathrm{norm}} ( t ) \rangle\) and \(\langle p_{i}^{\mathrm{norm}} ( t ) \rangle\) (Fig. 5(c)). This finding suggests that \(\langle p_{i}^{\mathrm{norm}} ( t ) \rangle\) could be a dominant factor determining the coupling strength, though negative correlations between \(\lambda_{i}^{\mathrm{norm}} ( t )\) and \(p_{i}^{\mathrm{norm}} ( t )\) may exist in a few SDNs (Fig. 3(i), Figs. 7(c), (f)).

Figure 5
figure 5

Pearson Correlation Coefficient (PCC) between coupling strength and (a) speed ratio, (b) average trip distance, and (c) fraction of travelers from ordinary zones. The average value of each index across all studied SDNs and time windows is calculated. The blue line in (c) denotes the linear regression function

Using Monday and Saturday as examples, temporal distributions of all indexes for each SDN are analyzed (Fig. 6). Pearson Correlation Coefficient (PCC) between \(\lambda_{i}^{\mathrm{norm}} ( t )\) and \(\beta_{i}^{\mathrm{norm}} ( t )\), PCC between \(\lambda_{i}^{\mathrm{norm}} ( t )\) and \(d_{i}^{\mathrm{norm}} ( t )\), and PCC between \(\lambda_{i}^{\mathrm{norm}} ( t )\) and \(p_{i}^{\mathrm{norm}} ( t )\) are measured (Fig. 7). The speed ratio index and the origin composition index of all SDNs follow similar trends. While several SDNs show different trends for the coupling index and the distance index (Fig. 6). As shown in Fig. 7, origin composition index \(p_{i}^{\mathrm{norm}} ( t )\) shows strong positive correlation with coupling strength \(\lambda_{i}^{\mathrm{norm}} ( t )\) for most SDNs, while fewer SDNs are found to have such strong correlation between distance index \(d_{i}^{\mathrm{norm}} ( t )\) and coupling index \(\lambda_{i}^{\mathrm{norm}} ( t )\). The speed index \(\beta_{i}^{\mathrm{norm}} ( t )\), which was considered a key factor in coupling dynamic studies, has no prominent correlation with coupling index \(\lambda_{i}^{\mathrm{norm}} ( t )\) for the majority of SDNs.

Figure 6
figure 6

Temporal features of index values. The red line denotes the average index value and the light gray line denotes the index value of each SDN. (a), (c) Coupling strength \(\lambda^{\mathrm{norm}}\); (b), (f) Speed ratio \(\beta^{\mathrm{norm}}\); (c), (g) Average trip distance \(d^{\mathrm{norm}}\); (d), (h) Fraction of trips from ordinary zones \(p^{\mathrm{norm}}\)

Figure 7
figure 7

Distribution of PCCs between coupling strength and each index for all studied SDNs on Monday and Saturday. Coupling strength \(\lambda^{\mathrm{norm}}\) vs. (a), (d) speed ratio \(\beta^{\mathrm{norm}}\); (b), (e) average trip distance \(d^{\mathrm{norm}}\); (c), (f) the fraction of travelers from ordinary zones \(p^{\mathrm{norm}}\)

The PCC values were also illustrated in space using the destination zone of each SDN (Fig. 8). No prominent spatial pattern is found for the correlation between speed ratio index \(\beta_{i}^{\mathrm{norm}} ( t )\) and coupling index \(\lambda_{i}^{\mathrm{norm}} ( t )\) (Figs. 8(a), (b)). Passengers seem to have a random reaction with the time-varying travel speed on different SDNs. Interestingly, the destination zones of SDNs with strong positive correlations between distance index \(d_{i}^{\mathrm{norm}} ( t )\) and coupling index \(\lambda_{i}^{\mathrm{norm}} ( t )\) are located in the central urban area. The destination zones of SDNs with weak or negative correlations between \(d_{i}^{\mathrm{norm}} ( t )\) and \(\lambda_{i}^{\mathrm{norm}} ( t )\) are located in the periphery area (Figs. 8(c), (d)). The peripheral destination zones attract long-distance travelers with less use of multimode transport, which is probably caused by that these zones attract many long-distance travelers from the city center with wide accessibility to public transportation. The destination zones of SDNs with strong positive correlations between \(p_{i}^{\mathrm{norm}} ( t )\) and \(\lambda_{i}^{\mathrm{norm}} ( t )\) are widely distributed across the investigated area. A few exceptions with weak correlations between \(p_{i}^{\mathrm{norm}} ( t )\) and \(\lambda_{i}^{\mathrm{norm}} ( t )\) are also found. This implies that though origin composition index \(p_{i}^{\mathrm{norm}} ( t )\) is the dominant factor determining the coupling strength, the coupling strength is also determined by other factors.

Figure 8
figure 8

Spatial distribution of destination zones of SDNs with different levels of correlations between (a), (b) \(\lambda^{\mathrm{norm}}\) and \(\beta^{\mathbf{norm}}\), (c), (d) \(\lambda^{\mathbf{norm}}\) and \(d^{\mathbf{norm}}\), (e), (f) \(\lambda^{\mathbf{norm}}\) and \(p^{\mathbf{norm}}\)

4.3 A new index showing higher correlations with coupling strength

For SDNs that have moderate or weak correlations between \(p_{i}^{\mathrm{norm}} ( t )\) and \(\lambda_{i}^{\mathrm{norm}} ( t )\), we suppose that the distance index can be a secondary factor to influence the coupling. We respectively analyze the fraction of multimode trips \(f ( l )\) for travelers originating from transfer zones and ordinary zones (Fig. 9). On one hand, multimode trips originating from transfer zones account for less than 2% of the overall multimode trips, independent of trip distance l. Hence, two modes of transportation are barely used in trips connecting two zones with subway stations. This is consistent with our intuition. Subway usually has a faster speed than buses. Even when a mixing use of subway and bus is faster than a unimodal subway trip, travelers are not likely to do so due to the inconvenience of transfers [49, 50]. On the other hand, for trips originating from ordinary zones, multimode trips are increasingly favored as the trip distance increases. The finding can be explained by the decreasing availability of direct buses to destinations and the increasing attraction of reduced travel times provided by the subway. A transition point, however, is observed when the trip distance exceeds 33 km. It is probably because multimode trips are replaced by customized bus lines connecting major remote districts.

Figure 9
figure 9

Fraction of multimode trips \(f ( l)\) vs. trip distance l. The blue dot represents the fraction of travelers originating from ordinary zones using multimode transport. The red dot represents the fraction of travelers originating from transfer zones using multimode transport

Taken together, the use of multimode transport majorly exists within trips from ordinary zones to transfer zones. Neither the trip distance nor the speed ratio has a large impact on passengers’ routing behavior for trips originating from transfer zones. While for travelers traveling from ordinary zones to transfer zones, the trip distance plays an important role in determining whether to use a multimode transport. These findings motivate us to incorporate the distance of trips originating from ordinary zones into the origin composition index \(p_{i} ( t )\). Hence, we propose a new index \(\xi_{i} ( t )\), the trip distance weighted fraction of passengers from ordinary zones. We use the regression function \(f ( l )\) shown in Fig. 9 to calculate the weight for the origin composition index \(p_{i} ( t )\):

$$\begin{aligned} &w_{i} ( t ) = \frac{-0.000797 ( d_{i}^{\mathrm{ord}} ( t ) )^{2} +0.05 d_{i}^{\mathrm{ord}} (t)-0.13}{f_{\mathrm{max}}}, \end{aligned}$$
(11)
$$\begin{aligned} &\xi_{i} ( t ) = w_{i} ( t ) \times p_{i} ( t ), \end{aligned}$$
(12)

where \(d_{i}^{\mathrm{ord}} ( t )\) is the average distance of trips from ordinary zones on SDN i and \(f_{\mathrm{max}}\) the local maximum value of \(f ( l )\) show in Fig. 9. We normalize the weighted origin composition index for each SDN using the maximum–minimum normalization method:

$$ \xi_{i}^{\mathrm{norm}} ( t ) = \frac{\xi_{i} ( t ) - \xi_{i}^{\mathrm{min}}}{\xi_{i}^{\mathrm{max}} - \xi_{i}^{\mathrm{min}}}, $$
(13)

where \(\xi_{i}^{\mathrm{max}}\) and \(\xi_{i}^{\mathrm{min}}\) denote the maximum and minimum values of \(\xi_{i} ( t )\) in the day. We obtained higher correlations between the calibrated index \(\xi_{i} ( t )\) and coupling strength \(\lambda_{i} ( t )\) for most SDNs (Fig. 10). Table 1 summarizes the average PCC values between coupling strength and the studied indexes and the \(R^{2}\) values of linear fits between coupling strength and the indexes. In general, the proposed index \(\xi_{i} ( t )\) improves the correlation between coupling index \(\lambda_{i} ( t )\) and origin composition index \(\xi_{i} ( t )\).

Figure 10
figure 10

PCCs between coupling strength and average trip distance weighted fraction of travelers from ordinary zones

Table 1 Average PCC values and \(R^{2}\) values between measured indexes across studied SDNs

5 Conclusions and discussion

In summary, we found that the coupling between the studied bus network and subway network is mainly generated by the long-distance trips originating from ordinary zones. Multimode trips are favored from an ordinary zone to a transfer zone when trip distances are long. Counterintuitively, the coupling strength of the studied subway network and bus network is weakly correlated with the speed ratio of the two networks. This can be caused by that passengers do not have a full view of global traffic states. This can be also caused by that passengers select the routes according to personal favor or local traffic conditions.

The present study is limited by the lack of data in diverse cities of different sizes and different transportation structures. However, we believe the results will have a lasting impact in the research areas of multiplex networks and urban transportation. We expect our work will attract the interest of researchers from related research communities. Similar analyses can be done for other cities to generalize the findings and open new research directions. For example, understanding the microcosmic route selection behavior of passengers can help understand why passengers use buses or multimode transport between two transfer zones with subway connection; investigating the land use types of destination zones can help further understand different temporal features of travel demand to specific destination areas.

Nowadays, many operational practices, such as the synchronization of public transport schedules, are implemented to improve the connections between transportation networks and attract more people to public transportation. However, our results imply that the use of multiplex transportation networks is not determined by the time-varying service conditions but by the distribution of mobility demand and transportation facilities. In other words, most passengers select two networks to complete their trips, because they have no direct transportation services to their destinations. Therefore, the coupling strength of two network layers will increase when the mobility demands of passengers from long-distant ordinary zones increase. These mobility demands are actually caused by the lack of necessary transportation facilities. Our finding highlights the need for a better understanding of passenger traffic demand, and more attention needs to be paid to the planning of multiplex transportation networks rather than to the operational aspects of current practices.

Abbreviations

GIS:

Geographic Information System

CDR:

Call Detail Records

GPS:

Global Positioning System

SDN:

Single-destination Network

PCC:

Pearson Correlation Coefficient

References

  1. Szell M, Lambiotte R, Thurner S (2010) Multirelational organization of large-scale social networks in an online world. Proc Natl Acad Sci USA 107:13636–13641. https://doi.org/10.1073/pnas.1004008107

    Article  Google Scholar 

  2. Bullmore E, Sporns O (2009) Complex brain networks: graph theoretical analysis of structural and functional systems. Nat Rev Neurosci 10:312. https://doi.org/10.1038/nrn2618

    Article  Google Scholar 

  3. Rosato V, Issacharoff L, Tiriticco F et al. (2008) Modelling interdependent infrastructures using interacting dynamical models. Int J Crit Infrastructures 4:63. https://doi.org/10.1504/IJCIS.2008.016092

    Article  Google Scholar 

  4. Kurant M, Thiran P (2006) Layered complex networks. Phys Rev Lett 96:138701. https://doi.org/10.1103/PhysRevLett.96.138701

    Article  Google Scholar 

  5. Zou SR, Zhou T, Liu AF et al. (2010) Topological relation of layered complex networks. Phys Lett A 374:4406–4410. https://doi.org/10.1016/j.physleta.2010.08.073

    Article  MATH  Google Scholar 

  6. Radicchi F, Arenas A (2013) Abrupt transition in the structural formation of interconnected networks. Nat Phys 9:717–720. https://doi.org/10.1038/nphys2761

    Article  Google Scholar 

  7. Buldyrev SV, Parshani R, Paul G et al. (2010) Catastrophic cascade of failures in interdependent networks. Nature 464:1025–1028. https://doi.org/10.1038/nature08932

    Article  Google Scholar 

  8. Parshani R, Buldyrev SV, Havlin S (2011) Critical effect of dependency groups on the function of networks. Proc Natl Acad Sci USA 108:1007–1010. https://doi.org/10.1073/pnas.1008404108

    Article  Google Scholar 

  9. Gao J, Buldyrev SV, Stanley HE, Havlin S (2011) Networks formed from interdependent networks. Nat Phys 8:40–48. https://doi.org/10.1038/nphys2180

    Article  Google Scholar 

  10. Lee KM, Kim JY, Cho WK et al. (2012) Correlated multiplexity and connectivity of multiplex random networks. New J Phys 14:033027. https://doi.org/10.1088/1367-2630/14/3/033027

    Article  Google Scholar 

  11. Brummitt CD, D’Souza RM, Leicht EA (2012) Suppressing cascades of load in interdependent networks. Proc Natl Acad Sci USA 109:E680–E689. https://doi.org/10.1073/pnas.1110586109

    Article  Google Scholar 

  12. Brummitt CD, Lee KM, Goh KI (2012) Multiplexity-facilitated cascades in networks. Phys Rev E, Stat Nonlinear Soft Matter Phys 85:045102. https://doi.org/10.1103/PhysRevE.85.045102

    Article  Google Scholar 

  13. Ruan Z, Tang M, Liu Z (2012) Epidemic spreading with information-driven vaccination. Phys Rev E, Stat Nonlinear Soft Matter Phys 86:036117. https://doi.org/10.1103/PhysRevE.86.036117

    Article  Google Scholar 

  14. Ruan Z, Hui P, Lin H, Liu Z (2013) Risks of an epidemic in a two-layered railway-local area traveling network. Eur Phys J B 86:13. https://doi.org/10.1140/epjb/e2012-30292-x

    Article  MathSciNet  Google Scholar 

  15. Granell C, GĂłmez S, Arenas A (2013) Dynamical interplay between awareness and epidemic spreading in multiplex networks. Phys Rev Lett 111:128701. https://doi.org/10.1103/PhysRevLett.111.128701

    Article  Google Scholar 

  16. Funk S, Jansen VAA (2010) Interacting epidemics on overlay networks. Phys Rev E, Stat Nonlinear Soft Matter Phys 81:036118. https://doi.org/10.1103/PhysRevE.81.036118

    Article  Google Scholar 

  17. Saumell-Mendiola A, Serrano MA, Boguñá M (2012) Epidemic spreading on interconnected networks. Phys Rev E, Stat Nonlinear Soft Matter Phys 86:026106. https://doi.org/10.1103/PhysRevE.86.026106

    Article  Google Scholar 

  18. Funk S, Gilad E, Watkins C, Jansen VAA (2009) The spread of awareness and its impact on epidemic outbreaks. Proc Natl Acad Sci USA 106:6872–6877. https://doi.org/10.1073/pnas.0810762106

    Article  MATH  Google Scholar 

  19. Dickison M, Havlin S, Stanley HE (2012) Epidemics on interconnected networks. Phys Rev E, Stat Nonlinear Soft Matter Phys 85:066109. https://doi.org/10.1103/PhysRevE.85.066109

    Article  Google Scholar 

  20. Wang W, Tang M, Yang H et al. (2014) Asymmetrically interacting spreading dynamics on complex layered networks. Sci Rep 4:5097. https://doi.org/10.1038/srep05097

    Article  Google Scholar 

  21. Morris RG, Barthelemy M (2012) Transport on coupled spatial networks. Phys Rev Lett 109:128703. https://doi.org/10.1103/PhysRevLett.109.128703

    Article  Google Scholar 

  22. Gallotti R, Barthelemy M (2015) Anatomy and efficiency of urban multimodal mobility. Sci Rep 4:6911. https://doi.org/10.1038/srep06911

    Article  Google Scholar 

  23. Gu CG, Zou SR, Xu XL et al. (2011) Onset of cooperation between layered networks. Phys Rev E, Stat Nonlinear Soft Matter Phys 84:026101. https://doi.org/10.1103/PhysRevE.84.026101

    Article  Google Scholar 

  24. Strano E, Shai S, Dobson S, Barthelemy M (2015) Multiplex networks in metropolitan areas: generic features and local effects. J R Soc Interface 12:20150651. https://doi.org/10.1098/rsif.2015.0651

    Article  Google Scholar 

  25. Sun L, Liu L, Xu Z et al. (2015) Locating inefficient links in a large-scale transportation network. Physica A 419:537–545. https://doi.org/10.1016/j.physa.2014.10.066

    Article  Google Scholar 

  26. Noel EC (1988) Park-and-ride: alive, well, and expanding in the United States. J Urban Plan Dev. http://ascelibrary.org.ezproxy.ucn.cl/doi/pdf/10.1061/%28ASCE%290733-9488%281988%29114%3A1%282%29

  27. Chodrow PS, Al-Awwad Z, Jiang S, González MC (2016) Demand and congestion in multiplex transportation networks. PLoS ONE 11:1–10. https://doi.org/10.1371/journal.pone.0161738

    Article  Google Scholar 

  28. Tan F, Wu J, Xia Y, Tse CK (2014) Traffic congestion in interconnected complex networks. Phys Rev E, Stat Nonlinear Soft Matter Phys 89:062813. https://doi.org/10.1103/PhysRevE.89.062813

    Article  Google Scholar 

  29. Du WB, Zhou XL, Chen Z et al. (2014) Traffic dynamics on coupled spatial networks. Chaos Solitons Fractals 68:72–77. https://doi.org/10.1016/j.chaos.2014.07.009

    Article  MATH  Google Scholar 

  30. He K, Xu Z, Wang P et al. (2016) Congestion avoidance routing based on large-scale social signals. IEEE Trans Intell Transp Syst 17:2613–2626. https://doi.org/10.1109/TITS.2015.2498186

    Article  Google Scholar 

  31. Wang P, Hunter T, Bayen AM et al. (2012) Understanding road usage patterns in urban areas. Sci Rep 2:1001. https://doi.org/10.1038/srep01001

    Article  Google Scholar 

  32. Ma T, Motta G, Liu K (2017) Delivering real-time information services on public transit: a framework. IEEE Trans Intell Transp Syst 18:2642–2656. https://doi.org/10.1109/TITS.2017.2656387

    Article  Google Scholar 

  33. Wong J (2013) Leveraging the general transit feed specification for efficient transit analysis. Transp Res Rec 2338:11–19. https://doi.org/10.3141/2338-02

    Article  Google Scholar 

  34. Hadas Y (2013) Assessing public transport systems connectivity based on Google transit data. J Transp Geogr 33:105–116. https://doi.org/10.1016/j.jtrangeo.2013.09.015

    Article  Google Scholar 

  35. Tao S, Rohde D, Corcoran J (2014) Examining the spatial-temporal dynamics of bus passenger travel behaviour using smart card data and the flow-comap. J Transp Geogr 41:21–36. https://doi.org/10.1016/j.jtrangeo.2014.08.006

    Article  Google Scholar 

  36. Lee SG, Hickman M (2013) Are transit trips symmetrical in time and space? Transp Res Rec 2382:173–180. https://doi.org/10.3141/2382-19

    Article  Google Scholar 

  37. Huang Z, Wang P, Zhang F, Gao J, Schich M (2018) A mobility network approach to identify and anticipate large crowd gatherings. Transp Res, Part B, Methodol 114:147–170. https://doi.org/10.1016/j.trb.2018.05.016

    Article  Google Scholar 

  38. Barry JJ, Newhouser R, Rahbee A, Sayeda S (2007) Origin and destination estimation in New York city with automated fare system data. Transp Res Rec 1817:183–187. https://doi.org/10.3141/1817-24

    Article  Google Scholar 

  39. Trépanier M, Tranchant N, Chapleau R (2007) Individual trip destination estimation in a transit smart card automated fare collection system. J Intell Transp Syst 11:1–14. https://doi.org/10.1080/15472450601122256

    Article  Google Scholar 

  40. Munizaga MA, Palma C (2012) Estimation of a disaggregate multimodal public transport origin-destination matrix from passive smartcard data from Santiago, Chile. Transp Res, Part C, Emerg Technol 24:9–18. https://doi.org/10.1016/j.trc.2012.01.007

    Article  Google Scholar 

  41. Munizaga M, Devillaine F, Navarrete C, Silva D (2014) Validating travel behavior estimated from smartcard data. Transp Res, Part C, Emerg Technol 44:70–79. https://doi.org/10.1016/j.trc.2014.03.008

    Article  Google Scholar 

  42. Bagchi M, White PR (2005) The potential of public transport smart card data. Transp Policy 12:464–474. https://doi.org/10.1016/j.tranpol.2005.06.008

    Article  Google Scholar 

  43. Bagchi M, White P (2004) What role for smart-card data from bus systems? Proc ICE Munic Eng 157:39–46. https://doi.org/10.1680/muen.157.1.39.36598

    Article  Google Scholar 

  44. Hofmann M, O’Mahony M (2005) Transfer journey identification and analyses from electronic fare collection data. In: IEEE conf intell transp syst proceedings, ITSC, pp 825–830. https://doi.org/10.1109/ITSC.2005.1520156.

    Google Scholar 

  45. Seaborn C, Attanucci J, Wilson NHM (2009) Using smart card fare payment data to analyze multi-modal public transport journeys in London. Transp Res Rec 2121(1):55–62. https://doi.org/10.3141/2121-06

    Article  Google Scholar 

  46. Jang W (2010) Travel time and transfer analysis using transit smart card data. Transp Res Rec 2144:142–149. https://doi.org/10.3141/2144-16

    Article  Google Scholar 

  47. Devillaine F, Munizaga M, Trépanier M (2012) Detection of activities of public transport users by analyzing smart card data. Transp Res Rec 2276:48–55. https://doi.org/10.3141/2276-06

    Article  Google Scholar 

  48. Ma X, Wu YJ, Wang Y et al. (2013) Mining smart card data for transit riders’ travel patterns. Transp Res, Part C, Emerg Technol 36:1–12. https://doi.org/10.1016/j.trc.2013.07.010

    Article  Google Scholar 

  49. Jansen LN, Pedersen MB, Nielsen OA (2002) Minimizing passenger transfer times in public transport timetables. In: Proc seventh conf Hong Kong soc transp stud, pp 229–239

    Google Scholar 

  50. Guo Z, Wilson NHM (2011) Assessing the cost of transfer inconvenience in public transport systems: a case study of the London underground. Transp Res, Part A, Policy Pract 45:91–104. https://doi.org/10.1016/j.tra.2010.11.002

    Article  Google Scholar 

Download references

Acknowledgements

Not applicable.

Availability of data and materials

The data used in this article will not be shared because of the confidentiality agreement with the data provider.

Funding

National Natural Science Foundation of China (No. 61473320). Project of Innovation-driven Plan in Central South University (No. 2016CSX014). National Key Research and Development Program of China (No. 2016YFB1200400). Hunan Science and Technology Plan Project (No. 2015RS4011).

Author information

Authors and Affiliations

Authors

Contributions

ZZ, ZH, FZ analyzed the data. ZZ, PW wrote the manuscript. PW designed the research. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Pu Wang.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic Supplementary Material

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zheng, Z., Huang, Z., Zhang, F. et al. Understanding coupling dynamics of public transportation networks. EPJ Data Sci. 7, 23 (2018). https://doi.org/10.1140/epjds/s13688-018-0148-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1140/epjds/s13688-018-0148-6

Keywords