 regular article
 Open Access
 Published:
Debye–Hückel theory for refugees’ migration
EPJ Data Science volume 7, Article number: 22 (2018)
Abstract
In this paper, we follow the shortranged Syrian refugees’ migration to Lebanon as documented by the UNHCR. We propose a model inspired by the Debye–Hückel theory and show that it properly predicts the refugees’ mobility while the gravity model fails. We claim that the interaction between origin cities attenuates and/or extenuates the flux to destinations, and consequently, in analogy with the effective charges of interacting particles in a plasma, these source cities are characterized by effective populations determined by their pairwise remoteness/closeness and defined by areas of control between the fighting parties.
There is a longstanding history of the study of human mobility patterns, with reliance on census data, which spans the works of Ravestein who focused on migration within the UK [1], Zipf’s intercity \(P_{1}P_{2}/D\) movement [2], the intervening opportunities model [3, 4], urban travel demands and regional modeling [5, 6], all of which contributed to the understanding of economic processes [7, 8], urban planning, traffic engineering [9, 10], and the spreading of infectious diseases [11–14]. At present, the ubiquity of mobile phone usage data and credit card transactions made human mobility more amenable to mathematical analysis, and therefore, lead to the discovery of underlying patterns of motion described as random walks and particularly Lévy flights [15–17]. It also revealed universal behavior, which was explained by the gravity and the radiation models [18–24], which also describes trade flow [7, 25] among other types of traffic. The first links the flux between origin and destination cities to powers of their initial populations with an inverse dependence on their pairwise distance given by:
where \(T_{ij}\) is the influx between the source city i and the destination city j, \(p_{i}\) and \(p_{j}\) are their respective initial populations, \(d_{ij}\) is their pairwise distance, α, β, and γ are the model parameters. On the other hand the radiation model is a parameterfree model given by:
where \(T_{i} = \sum_{i \neq j} T_{ij}\) the total number of commuters at location i, and \(s_{ij}\) is the population density in a circle centered at i.
These two competing models, with variants [26], have been shown to properly predict human mobility and their relative likelihood has been explored [23, 27]. However, their validity to model mobility under stressful conditions beyond the daily commute or travel has not been examined. Moreover, there is a clear evidence of the breakdown of models based on stationary events in explaining behavioral changes in human activity under severe conditions [28–34].
In addition, the understanding of human flows under external perturbations and abrupt changes such as panic, stampedes and motion beyond the stationarity of daily activity, is pivotal to emergency response and crisis management and thus the need for models which capture these dynamics is unequivocal. Of particular interest is the refugee migration, which is a form of human mobility under life threatening circumstances. Unlike the long timescale decision underlying the job selection process according to which the origin and destination are determined, refugees’ migration is a pressing decision made on a very short timescale to flee a endangering environment. In this paper we are specifically concerned with the displaced Syrian population to neighboring countries and particularly to Lebanon, a highly vulnerable country to the crisis spillover because of the large refugees’ influx straining the infrastructure and further destabilizing Lebanon’s socioeconomic conditions [35–37]. The unavailability of the full migration data out of Syria restricted our data exploration and model fitting to a modification of the gravity model and made it impossible to rule out the explanatory potential of the parameterfree radiation model. We thus present an adjusted gravity model incorporating the peculiarities of refugees’ mobility using the United Nations Human Rights Council (UNHCR) data made available to our lab through an official data sharing request documenting their influx up till 2014 when the Lebanese–Syrian borders were closed [36]. The data chronicles the cumulative refugees’ count until May 2014 detailing of their origins and destinations.
Theory
In econometrics, correction had been introduced to the gravity model as it failed to take into consideration the multilateral resistance terms in trade. The equation was either augmented with an exponential term accounting for the importer exporter difference, known as the Anderson–van Wincoop Gravity equation or with a stochastic effect [7]. Physically this is equivalent to shifting from an isolated cities paradigm to a description that takes into account the cities interactions. Analogously, Coulomb’s law, a special case of Eq. (1) for \(\alpha, \beta= 1\) and \(\gamma= 2\), which describes the interaction between two isolated particles with charges \(p_{i}\) and \(p_{j}\) is invalidated when the particles are in an electrolyte or a plasma [38]. More precisely, the surrounding particles screen the potential and subsequently their effect is equivalent to the renormalization of the bare charge from \(p_{i} \rightarrow p_{i} e^{r/l}\), where l is the Debye Hückel length.
Based on the above, two basic contentions underly our paper: when medium to long range mobility between non isolated cities is studied, the interaction between the source cities cannot be neglected and this is the reason why we suspect that the gravity model fails to explain the migration data. Thus, we argue that the latter is the ideal limit behavior and, therefore, accounting for the presence of multiple nearby source cities and their effect in shielding or accentuating the flux, allows to properly predict the migration as is the case of an electron in a plasma where the generated shielding depends on the configuration of the surrounding charged particle [39]. Our second argument is based on the symmetry breaking nature of the refugees’ migration. The flux becomes almost unidirectional as people from the host cities are unlikely to leave to unsafe destinations, that is: \(T_{ij} \gg T_{ji}\) even when their populations \(p_{i}\) and \(p_{j}\) are comparable, where i and j are Syrian and Lebanese cities respectively. For this we rewrite the gravity model with the renormalized effect:
where \(d_{ik}\) is the distance between the origin city i and the rest of the origins k and \(\delta_{k}\) is a model parameter. Each origin city is tagged with a corresponding \(\delta_{k}\), which shields the flux when it is negative and accentuates it for positive values of the parameter. Therefore, \(\delta_{k}\) can be thought of as a characteristic length marking the influence of the origin city k on the migration from i, while the sign of \(\delta_{k}\) determines its effect on the increase/decrease of the migration out of i. More precisely, when \(d_{ik}/\delta_{k}  \ll1\), the exponential term barely changes the effective population of i. On the contrary, when \(d_{ik}/\delta_{k}  \gg1\) the migration out of i is highly affected by city k either positively or negatively depending on the sign of \(\delta_{k}\).
Moreover, Eq. (3) incorporates the highly nonsymmetrical nature of the flux. Explicitly, when i denotes a Lebanese city and j is a Syrian destination the exponential term introduces the possibility of obstructing the flow in this direction for negative values of \(\delta_{k}\).
Finally, it should be noted than unlike the Debye–Hückel model where a single lengthscale characterizes the system’s particles, in our model each city is characterized by its own \(\delta_{k}\) and consequently the analogy with the DH model is incomplete.
Methods and results
Figure 1 shows the chord diagram depicting the flow from Syria to Lebanon with the width of the chords characterizing the flux intensity. The diagram reveals preferential migration from certain origin cities to specific destinations, which is explained by our model as we will show in what follows. The diagram was produced using the Migration data.xlsx.
In order to test our model and compare it with both the radiation and the gravity models we used Google API to calculate the cities’ pairwise distance, which is the shortest path over the road network; the Distance.csv file includes the Lebanese to Syrian cities’ distances while SyriaSyriaDistance.csv is the inter Syrian cities distance matrix. Additionally, in the face of both Lebanon and Syria’s lack of census data we retrieved their estimates from the City population website [40]. Moreover, we gathered the population density from the Gridded Population of the World, Version 4 (GPWv4) as a raster file with 1 km resolution [41]. The latter was then intersected with Lebanon, Syria, Jordan, Turkey and Iraq’s shapefiles to produce the population density raster for this region of interest. Subsequently, \(s_{ij}\) is assigned the mean value of the population density inside the circle centered at Syrian origin city i with radius \(d_{ij}\) where the latter is the distance from i to a Lebanese city j; the results are tabulated in DistanceMatrixSij.csv. We then performed a regression on the data of \(T_{ij}\) predicted by the radiation model against the real refugee fluxes. The model’s Multiple Rsquared is 0.0152 with an Adjusted Rsquared of 0.01235. However, we still cannot not rule out the radiation model since \(T_{i}\), which is the total number of refugees leaving location i, could not be calculated using this UNHCR data, which only provides the total number of refugees fleeing to Lebanon and excludes the ones fleeing to other neighboring countries. Nevertheless, we suspect that the asymmetry in the flux can no longer be explained by the radiation model since its derivation relied upon integrating over a single benefits distribution function \(p(z)\), which should be modified to incorporate the effect of war. Similarly we performed a regression against the data for the gravity model whose Multiple Rsquared is given by 0.4533 with an the Adjusted Rsquared: 0.4486.
Finally, our model’s regression results are given by a Multiple Rsquared of 0.8082 and an Adjusted Rsquared of 0.8, which clearly outperforms both models. This was achieved through 10,000 successive sampling in each iteration of which the model is trained over 80% of the data and tested over the rest and subsequently the one with minimal test error was chosen. Further, we performed an exhaustive test for model selection using both backward and forward stepwise regressions and subsequently our variables selection was made according to the Akaike Information Criterion (AIC); its minimum is given by 32.098, which corresponds to including all the variables in Eq. (3) and thus the removal of variables from our model does not improve the predictions.
Moreover, it should be noted that the variables \(d_{ij}\) are inherently interdependent since they mark intercities distances; that is \(d_{ik}\) is related to a combination of \(d_{ij}\) and \(d_{kj}\). In the case of predictors’ multicollinearity their coefficient’s variances can be highly sensitive to changes in the model as is the case when more migration data at a finer spatial resolution is added, which increases the number of origin cities k. The stability in the coefficients’ variances can be guaranteed when their high Variance Inflation Factor (VIF), which is signature of the multicollinearity, is countered by high regressor’s variability. More precisely, the efficiency of extrapolating the fitted model to newly acquired data is not affected by the presence of multicollinearity as long as the new data and the one on which the regression model is built share the same multicollinearity pattern [42], which is the case here since the data of the potential new cities include equally multicollinear distances variables. Equivalently, the scope of the model is restricted to the range of predictors that exhibit the same multicollinearity pattern.
Figure 2 shows the UNHCR data together with the red curve fitting the logarithm of Eq. (3) given by \(\log{T_{ij}}\) as a function of the \(\log{A} + \alpha\log{p_{i}} + \beta \log{p_{j}}  \gamma\log{d_{ij}} + \sum_{k \in\mathcal{O}} d_{ik}/ \delta_{k}\). Table 1 summarizes the values of \(\delta_{k}\) while Fig. 3 shows the map of Syria with circles centered at the source cities’ with radii proportional to \(\delta_{k}\) and color coded by the sign of their corresponding \(\delta_{k}\).
Discussion
The highly populated provinces in Syria, Aleppo, Homs, Idleb, Rural Damascus, and Hama contributed the most to the influx to Lebanon as shown in Fig. 1. Those also witnessed the major combats. In order to understand these patterns some socioeconomic background will be drawn [43, 44]. Before the war, Syrians sought job opportunities in Lebanon, and particularly the rural population of Idleb, which then hosted relatives and contacts fleeing the war. Consequently, this explains the scatter of the Idleb refugees across the Lebanese territories. This migration is mainly linked to a network of workers who chose their destinations based on the market’s demand, and subsequently after the war their families followed them to seek refuge.
Refugees from Homs are concentrated in the nearby cities of Akkar, Tripoli, and Koura all of which are in the neighboring Northern Lebanon and share a common prevalent sunni denomination, while those migrating from Aleppo are settled in Zahleh and in Beirut mainly. Here a distinction should be made between the rural and the urban populations of the province. The Syrian–Armenian refugee community coming from Aleppo settle in Burj Hammoud in Beirut, which is dominantly LebaneseArmenian, and is drawn to the area through family ties and shared craftsmentship while city dwellers and businessmen head to downtown Beirut. Conversely, those coming from the rural areas in Aleppo settle in the Bekaa. Additionally, the influx from Soueida to Chouf reveals the family ties connecting the Druze community across borders [45].
Despite these peculiarities and the different socioeconomic factors driving the migration, the results of our model revealed the existence of two types of origin cities depending on the sign of their corresponding \(\delta_{k}\). We suspect that this is the result of the interplay between areas controlled by the central government and areas controlled by its opponents, and is thus equivalent to an internally drawn border constraining migration and defining its routes. To check this hypothesis we compared our findings with reports delineating areas of control and show that our findings are strikingly in agreement with these maps [45]. In the latter, cities under central government control are shown in light purple and these correspond to the cities with positive \(\delta_{k}\) in our model, which we denote by \(\mathcal{R}\), while the others held by all the regime’s opponents correspond to cities with negative \(\delta_{k}\) denoted by \(\mathcal{RO}\). Particularly, when people are far from a region \(\mathcal{R}\) under government control they are more likely to migrate while the opposite effect occurs when they are far from \(\mathcal{RO}\). More precisely, when \(d_{ik}/{\delta_{k}} \gg0\), that is when the origin city i is very far from k, where the latter is under government control the migration out of i intensifies. Conversely, if a city i is distant from k where the latter is under \(\mathcal{RO}\) the likelihood of migration decreases. Thus, the cities with positive \(\delta_{k}\) increase the migration out of the source cities, while those with negative \(\delta_{k}\) have the contrary effect. Therefore, our parameter \(\delta _{k}\) is a proxy to understanding how the local population perceives the safety and security of their origin cities in relation to their remoteness from \(\mathcal{R}\) and \(\mathcal{RO}\).
Conclusively, the Debye–Hückel theory proved to have correspondences with the sociology of human migration under duress. In analogy with other models borrowed from physics, the“microscopic details” of the system under scrutiny turned out of be irrelevant; that is of little effect on the macroscopic process [46, 47]. In the case of migration this entails that the individual level variability of the refugees, their socioencomonic backgrounds and the geopolitics of war seem to be averaged out and aggregated into an exponential term. The model suggests this is achieved through the refugee’s scanning of space and moving towards the perceived safe destination.
Conclusion
In this paper, we have presented a model for refugees’ migration, which is based on the idea of interaction between source cities. Effectively, this resulted in a renormalization in the source cities’ population as our model suggested. Consequently, cities were classified according to the sign of their corresponding \(\delta_{k}\). This sign difference was linked to the interplay between different areas of control leading to a space dependent migration undergoing varying degrees of friction. Our model thus represents an attempt to predicting human mobility in relation to space and its fragmentation between the fighting parties. The analysis of these patterns should also complemented with migration data to other neighboring countries, which we did not have access to.
Abbreviations
 \(\mathcal{R}\) :

Regime controlled cities
 \(\mathcal{RO}\) :

Regime opponents’ controlled cities
References
Ravenstein EG (1885) The laws of migration. J Stat Soc Lond 48(2):167–235
Zipf GK (1946) The \(\frac{P_{1} P_{2}}{ D}\) hypothesis: on the intercity movement of persons. Am Sociol Rev 11(6):677–686
Stouffer SA (1960) Intervening opportunities and competing migrants. J Reg Sci 2(1):1–26
Stouffer SA (1940) Intervening opportunities: a theory relating mobility and distance. Am Sociol Rev 5(6):845–867
Domenicich T, McFadden D (1975) Urban travel demand: a behavioral analysis. NorthHolland, Amsterdam
Wilson AG (1970) Entropy in urban and regional modelling. Pion, London
Anderson JE, van Wincoop E (2003) Gravity with gravitas: a solution to the border puzzle. Am Econ Rev 93(1):170–192
Rouwendal J, Nijkamp P (2004) Living in two worlds: a review of hometowork decisions. Growth Change 35(3):287–303
de Dios Ortúozar J, Willumsen LG (2011) Modelling transport. Wiley, Chichester
Helbing D (2001) Traffic and related selfdriven manyparticle systems. Rev Mod Phys 73(4):1067–1141
Balcan D, Colizza V, Gonçalves B, Hu H, Ramasco JJ, Vespignani A (2009) Multiscale mobility networks and the spatial spreading of infectious diseases. Proc Natl Acad Sci USA 106(51):21484–21489
Eubank S, Guclu H, Kumar VA, Marathe MV, Srinivasan A, Toroczkai Z, Wang N (2004) Modelling disease outbreaks in realistic urban social networks. Nature 429(6988):180
Hufnagel L, Brockmann D, Geisel T (2004) Forecast and control of epidemics in a globalized world. Proc Natl Acad Sci USA 101(42):15124–15129
Truscott J, Ferguson NM (2012) Evaluating the adequacy of gravity models as a description of human mobility for epidemic modelling. PLoS Comput Biol 8(10):e1002699
Çolak S, Alexander LP, Alvim BG, Mehndiratta SR, González MC (2015) Analyzing cell phone location data for urban travel. Transp Res Rec 2526:126–135
Brockmann D, Hufnagel L, Geisel T (2006) The scaling laws of human travel. Nature 439(7075):462–465
Schneider CM, Belik V, Couronné T, Smoreda Z, González MC (2013) Unravelling daily human mobility motifs. J R Soc Interface 10(84):20130246
Masucci AP, Serras J, Johansson A, Batty M (2013) Gravity versus radiation models: on the importance of scale and heterogeneity in commuting flows. Phys Rev E 88(2):022812
Yan XY, Zhao C, Fan Y, Di Z, Wang WX (2014) Universal predictability of mobility patterns in cities. J R Soc Interface 11(100):20140834
Wang XW, Han XP, Wang BH (2014) Correlations and scaling laws in human mobility. PLoS ONE 9(1):e84954
Song C, Qu Z, Blumm N, Barabási AL (2010) Limits of predictability in human mobility. Science 327(5968):1018–1021
González MC, Hidalgo CA, Barabási AL (2008) Understanding individual human mobility patterns. Nature 453(7196):779–782
Simini F, González MC, Maritan A, Barabási AL (2012) A universal model for mobility and migration patterns. Nature 484(7392):96–100
Hasan S, Schneider CM, Ukkusuri SV, González MC (2013) Spatiotemporal patterns of urban human mobility. J Stat Phys 151(1):304–318
Silva JS, Tenreyro S (2006) The log of gravity. Rev Econ Stat 88(4):641–658
Lenormand M, Bassolas A, Ramasco JJ (2016) Systematic comparison of trip distribution laws and models. J Transp Geogr 51:158–169
BarbosaFilho H, Barthelemy M, Ghoshal G, James CR, Lenormand M, Louail T, Menezes R, Ramasco JJ, Simini F, Tomasini M (2017) Human mobility: models and applications. arXiv:1710.00004
Bagrow JP, Wang D, Barabasi AL (2011) Collective response of human populations to largescale emergencies. PLoS ONE 6(3):e17680
Lu X, Bengtsson L, Holme P (2012) Predictability of population displacement after the 2010 Haiti earthquake. Proc Natl Acad Sci USA 109(29):11576–11581
Kapoor A, Eagle N, Horvitz E (2010) People, quakes, and communications: inferences from call dynamics about a seismic event and its influences on a population. In: AAAI spring symposium: artificial intelligence for development
Helbing D, Farkas I, Vicsek T (2000) Simulating dynamical features of escape panic. Nature 407(6803):487
Kaplan EH, Craft DL, Wein LM (2002) Emergency response to a smallpox attack: the case for mass vaccination. Proc Natl Acad Sci USA 99(16):10935–10940
Vespignani A (2009) Predicting the behavior of technosocial systems. Science 325(5939):425–428
Helbing D, Farkas IJ, Molnar P, Vicsek T (2002) Simulation of pedestrian crowds in normal and evacuation situations. Pedestr Evac Dyn 21(2):21–58
Government of Lebanon, The United Nations, (2017) Lebanon crisis response plan, pp 1–186
Cherri Z, Arcos González P, Castro Delgado R (2016) The Lebanese–Syrian crisis: impact of influx of Syrian refugees to an already weak state. Risk Manag Healthc Policy 9:165–172
Salem P (2012) Can Lebanon survive the Syrian crisis?. Carnegie Middle East Center, Carnegie Endowment for International Peace, Beirut
Pitzer KS (1973) Thermodynamics of electrolytes. I. Theoretical basis and general equations. J Phys Chem 77(2):268–277
McComb WD (2004) Renormalization methods: a guide for beginners. Oxford University Press, New York
Population estimates. https://www.citypopulation.de/
Center for International Earth Science Information Network—CIESIN, Columbia University (2017) Gridded population of the world, version 4 (GPWv4): population density, revision 10
Gujarati DN (2009) Basic econometrics. McGrawHill, Irwin.
Balanche F (2005) Syrie–Liban: intégration régionale ou dilution. Mappemonde 3
Balanche F (2011) Géographie de la révolte Syrienne. Outreterre 3:437–458
Balanche F Ethnic cleansing threatens Syria’s unity. Washington Institute for Near East Policy, Washington, DC. http://www.washingtoninstitute.org/policyanalysis/view/ethniccleansingthreatenssyriasunity
Galam S (2012) Sociophysics: a physicist’s modeling of psychopolitical phenomena. Springer, New York
Galam S (2013) When humans interact like atoms. Underst Group Behav 1:293–312
Acknowledgements
We acknowledge the UNHCR’s role in providing the data to our Center and are thankful to Fabrice Balanche and Stefano Monni for the valuable discussions. We are grateful to the late Albert Naccache for the thorough reading of the manuscript.
Availability of data and materials
The data is available to download as this paper’s supporting material.
Funding
The authors received to funding to carry out this work.
Author information
Authors and Affiliations
Contributions
SN conceived the theory, carried out the computations, GF analyzed the results, and they both wrote the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic Supplementary Material
Below are the links to the electronic supplementary material.
13688_2018_154_MOESM1_ESM.xlsx
Migration data. The Migration Data.xlsx is an excel file whose first row contains the name of the Syrian origin cities while the first column contains the Lebanese destination cities. The entries refer to the number of refugees fleeing from the corresponding origin city to the destination. (XLSX 8 kB)
13688_2018_154_MOESM2_ESM.csv
Distance. The Distance.csv contains the pairwise distance matrix between Origin and destination cities retrieved through the google API. (CSV 3 kB)
13688_2018_154_MOESM3_ESM.csv
Syria–Syriadistance. The SyriaSyriaDistance.csv contains the pairwise distance matrix between Syrian cities retrieved through the google API. (CSV 2 kB)
13688_2018_154_MOESM4_ESM.csv
Density. The DensityMatrixSij.csv contains the population density between every Syrian city i and Lebanese city j. (CSV 7 kB)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Najem, S., Faour, G. Debye–Hückel theory for refugees’ migration. EPJ Data Sci. 7, 22 (2018). https://doi.org/10.1140/epjds/s1368801801548
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjds/s1368801801548
Keywords
 Gravity model
 Mobility
 Migration