Stochastic Optimization of IACC Yacht Performance
In recent years, the use of deterministic race modelling programs (RMP) has become more and more popular among racing yacht skippers to help with their route planning decisions. They are very common now in round the word races and long oceanic regattas. In coastal regattas, however, they are not that common.
In parallel, this kind of methodologies has started to be applied in the design phase of some high end projects such as America’s Cup yachts. The first use of these tools in an America’s Cup design was reported in 1987 by Stars & Stripes.
However, the planning of routes for sailing vessels is subject to uncertainty from the weather. This is particularly important in yacht racing where the accuracy of a weather prediction can determine the outcome of a race. With a perfect weather forecast it is possible to use the polar tables of a given yacht to compute a route that minimizes its arrival time at its destination.
If we focus on the design phase, the uncertainty of weather becomes a new variable of the problem, making the stochastic analysis of the routing problem a crucial point if we intend to optimize the design.
The traditional approach to the optimization of sailboats was based on the educated analysis of the polar curves of the vessel. Before the dawning of velocity prediction programs (VPP), this approach was even more limited in the sense that it involved the optimization of ship resistance and not of ship speed (i.e. the searching for an optimum form coefficient for a given velocity). The interaction between the ship resistance, the stability and the aerodynamic forces is neglected. A further improvement of this model came with the introduction of the VPP.
The VPP calculates the speed of the boat for each angle of sailing and for each wind speed. Now the designer is capable of considering the trade off between performance with low winds vs performance with high winds, or offwind vs upwind. The connection between the performance of the boat and the weather is still made by estimation of a weight function of the speed in different points of sailing; or, even, by a simple visual examination of the shape of the polar curves. The approach proposed in this paper involves a closer connection between the weather and the yacht performance.
The common approach of the RMP for design application usually uses the output from a VPP along with historical weather data to assess different yacht designs, by racing candidate designs over a range of weather scenarios.
This work presents a new approach to the stochastic analysis and numerical simulation for high performance yacht design. We consider the problem of finding the best ship, that which maximizes the probability to win a race under stochastic weather conditions.
It seems logical to treat the weather as a stochastic process. Thus, the conditions in which the boat sails can be expressed as a random variable. This random variable has an occurrence probability associated. Therefore, the best boat under that random sailing condition has a better chance of winning the regatta.
This paper is laid out as follows. First the statement of the problem is presented, in terms of the necessary discretización of the racing area for the numerical simulation of the process. Afterwards, the methodology for yacht performance and race modeling is shown. Then the route optimization algorithm developed to calculate best route to goal and the stochastic methodology used are presented.
Finally, as an example, the analysis of different alternative designs to one real IACC ship is included.
This section shows the numerical discretisation of the racing simulation and the resulting statement of the problem.
Let be Γ the 2D domain, representing the racing area and R an overlapping Cartesian partition of Γ in square elements with index i ranging from 1 to the number of elements n (see Figure 1).

(1) 
where

(2) 
and being

(3) 
Weather conditions are defined by a vector of environment factors W_{i}, linked to one element of the racing area Gi. The environment factors are assumed to be piecewise constant and therefore discontinuous between elements.
Figure 1. Discretización of the racing area. 
The route of the ship through the discrete racing area is represented by a chain of linear paths L = { L_{i }}.
Every path L_{i} is included in the element G_{i}, and therefore may be defined by an initial point x^{i}_{ini} and a course angle through the element.
Possible paths through every element can vary continuously and therefore an infinite number of possible routes are considered.
This chapter presents a brief overview of the yacht behaviour modelling, necessary in order to carry out the performance optimization exercise.
The speed of a yacht is determined by many factors, including wind strength, true wind angle, current, waves, and sail settings. These can be broken down into environmental factors, such as wind speed and direction, seastate, and current, and controllable factors, such as sail choice and trim. It is assumed that, for any given environmental conditions, the controllable factors will be set such that they maximize the speed of the yacht. Given this assumption, the speed at which a yacht sails is dependent upon the wind strength and the boat’s heading relative to the wind.
For the purpose of this work, the maximum speed at which a yacht can sail for a given true wind angle and true wind speed is obtained by a numerical velocity prediction program (VPP).
The VPP gives as one of its outputs a matrix of boat speed values as function of course angle in relation to the wind and the strength of this. The usual VPP would give values up angles in the realm of 30º to the wind, angles smaller than that are not usually sail at (either for a physical impossibility or for bad performance). These smaller angles are not an output of the usual VPP. To run our simulation we intended to make the boat sail as fast as possible to weather and off the wind. To accomplish that in a systematic way the typical polar plots of speed had to be modified.
The way the polar plots are modified uses the concept of Velocity Made Good (VMG). The VMG is the projection of the velocity vector in the direction of the wind. Thus, it gives the speed at which we would be travelling straight into the wind or off it. But, the boat does not travel straight into the wind, it is just a way of calculating the time that it would take it to get there without the need to simulate the tacking.
This approach might be improved by including the effect of the time lost in tracking in the above mentioned modification of the VPP. This fact was not considered in this work.
The boat, however, would not always travel above the minimum angle of a polar plot or straight into the wind but in a combination of both. Thus, we modified the curves so they always give the speed of travel in the direction we need to calculate as a function of the angle to the wind. When the angle to the wind is smaller than the angle at which the VMG is produced the speed is calculated as if the boat would make as many tacks as needed to travel in that direction as fast as possible. The same approach is used to modify the velocity if the boat is sailing deeper than its VMG angle off the wind. In this case the boat is assumed to gibe as needed.
After this process we obtain a matrix that gives the effective speed of the boat as a function of angle to the wind (continuous from 0 to 180 degrees) and the strength of it. The data on this matrix is interpolated using a cubic spline. Figure 2 shows a typical plot of a single polar before and after modification.
Figure 2. Polar plot of speed for 6 knots of wind modified and original. 
In this approach, the output from the VPP is used in conjunction with historical weather data in a race modelling program (RMP) designed specially for this work. The route from initial point to goal is represented by a union of linear paths through every element (see Figure 3).
For every element, the ship path may be determined by given an initial point x^{i}_{ini} and a course angle, since the course angle and the local weather condition determine the ship speed through the VPP. Once ship speed is determined, and being constant within a quadrant, it is trivial to follow the ship’s path to reach the limit of the element. Thus the required time is calculated.
In this section we consider the problem of sailing between two marks in the minimum time. The parameters that define the problem are the coordinates of the start and finish locations x_{ini} = (x_{ini}, y_{ini}) ∈ G_{ini}and x_{end} = (x_{end}, y_{end}) ∈ G_{end}, the Cartesian discretization R of the racing area, and the wind and water current distribution.
The scheme implemented to calculate the route that requires the minimum time t to be covered, is based in three steps:
The graph describing the racing area structure is built from the connectivity matrix of the elements G_{i}. From this information, it is possible to obtain all the possible routes through the racing area, from G_{ini} to G_{end}. This is carried out by using a standard recursive DepthFirst Search algorithm [7]. Result of this algorithm is a list of all the possible sets of G = { G_{i} } starting from G_{ini} to G_{end }and complaining with the connectivity of the mesh (see Figure 3)
Figure 3. Example of route G_{8}G_{9}G_{6}G_{3}G_{2} from graph structure 
The local optimization algorithm is based on a bruteforce rough global search. The search proceeds trough n different courses from the entrance point of the current element x^{i}_{ini} to the next element (see Figure 4). A refinement of the obtained solution based on a standard gradient descent is then carried out.
Figure 4. Global search for the different courses in the current element. 
Finally, the global optimization algorithm is an iterative algorithm based on perturbation of the current route and a sensibility minimization scheme based on gradient descent.
For the purpose of this work, sensibility s is defined as:

(4) 
Where α is the course angle, and L_{i} the current path.
The basic idea of the global optimization algorithm is to perform a loop on the elements G_{i} of the current set and evaluate the sensibility of the time required to arrive to the goal, from the current point.
If a negative sensibility is found for the current route, then an optimization of the course, based on gradient descent is done. The rest of the course to the finish location x_{end} is performed by successive application of the local optimization algorithm presented above.
The external loop on the set of elements is repeated until no more negative sensibilities are found.
Once the optimum route is calculated, we can evaluate the sensibility of the optimum route, defined by,

(5) 
A basic diagram of the presented algorithm is shown in Figure 5.
Figure 5. Basic diagram of the optimization algorithm. 
The algorithm discussed above has been programmed in C++ using an object oriented structure.
Note that the algorithm presented in this section can be executed sequentially to analyse regattas based on one or more downwind and upwind legs.
This section shows the stochastic statement of the problem. Result of the next analysis will be the definition of the best design as that most likely to win the race.
Let {W_{i}}_{i=1,n} be the set of random variables associated with the triple (α,ν,τ) of every element G_{i} of the racing area.

(6) 
where α is the wind angle, ν the wind speed and τ the water current intensity of one element G_{i}.
W_{i} are discrete random variables and therefore may take a discrete number and additionally finite range of values.

(7) 
At this point it is necessary to find the combined probability associated to the appearance of one weather condition in the racing area.
Let us notate as R = (W_{1},..., W_{n}) the random vector associated to the weather conditions of the racing area.

(8) 
Let us assume that the weather information of the cells has been obtained by statistical analysis of the variation of conditions for a long period of time. This way, the weather information of one element takes into account in a natural way the influence of the surrounding quadrants. Therefore, under these assumptions, it is possible to accept the stochastic independence of the random variables and then we can write:

(9) 
For the purpose of this work, we will assume that every yacht cover the race in the minimum possible time. Then, Let T_{i} be the random variable defining the time required by a ship in finishing the race.
(10)
Where t^{i}_{j} is the minimum of the times required for the ship in covering any of the infinite number of possible routes through the regatta field. For the calculation of this value, the optimisation algorithm presented in the previous section is used.
T_{i} may take any of m possible values. Being m the number of different weather conditions considered for the analyses.
Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://test.scipedia.com:8081/localhost/v1/":): {\textstyle m=\prod_{i=1}^nq_i}
Using above definitions, it is possible to assign to every ship design B_{i}one value of T_{i} for every weather condition of the racing area.
Being R one specific weather configuration of the racing field,

(11) 
we can define the onetoone correspondence between the weather condition of the racing field and the arrival time of the ship at its destination. And then we can write,

(12) 
From eq. it is possible to calculate the probability of a yacht to win a race. Let us define event M_{i} as follows,

(13) 
At this point, it is possible to introduce the associate events A and D_{i},,

(14) 
and therefore,

(15) 
But, since the probability of an event D_{j} is known and determined by the distribution of the regatta weather field condition, we can associate to every yacht the new random variable M_{i}. This variable, defined by the probability of a ship to win the race has a Bernoulli probability distribution of parameter λ [11],

(16) 
and therefore,

(17) 
It is noticeable that the yacht with more probability to win one race is also the best ship for a regatta consisting on several races.
The application example presented in this section consists of the simulation of different IACC designs in a race composed of an upwind leg and downwind leg.
The reliability of the approach is directly proportional to the quality of the weather data. If the random variable describes accurately the weather field, the resulting probability distribution would be accurate. This weather data may be hard to obtain because a special discretization of the statistical data is needed. Climatological data is available in form of pilot charts for an oceanic race. This is not usually the case for a regatta in sheltered waters. In our case, we processed data from the last America’s Cup venue, New Zealand. The weather data was obtained of the analysis of available historical data of several weather stations. Since currents data were not available, their effect will be neglected for this analysis.
The fleet analysed consist on several IACC designs, including the ship geometry of the Spanish America’s Cup boat Rioja de España, participant in the races of 1995. The geometry of the boat and some towing test data were published in [5].
For this application, the racing field was discreticed in a 3x3 elements, resulting in the analysis of more than 19000 weather conditions.
The importance of the application of a stochastic approach is exemplified in Figure 6 and Figure 7. These figures represent the probability of winning and the number of victories that a boat would have if it competed versus all the others. The key point here is that if we were to look just at the number of victories as our criteria to choose the best boat we would be inclined to select yacht C. In reality, the highest possibility of winning is that of yacht E.
Figure 6. Victory Probability. Upwind Leg. 
The explanation for this becomes obvious once we consider that the description of the race field that we have made. Since this description is stochastic the weather is a random variable with different probabilities of occurrence. Thus, it is possible that a boat that wins many regattas but in the less likely conditions would have a worse chance at winning that one that wins fewer regattas but wins the ones that are most likely to occur.
Figure 7. Number of Victories. Upwind Leg. 
It is for those reasons that we consider that the stochastic connection between the characteristics of the boat and its performance around a given course gives a better definition of the best boat.
The methodology presented is applicable across every stage in the design process.
Firstly, in the preliminary design phase, it may serve as a tool to discriminate certain parts of the parametric design space. We would be looking for a zone in that space that maximizes the probability of winning. This is not as straight forward as it may seem. The set of test cases has to cover the design space adequately. Thus, if we have a set of cases to study just certain variables, the variables that are left out should be kept as constant as possible. In Figure 8, we can observe a surface formed by the probability of winning of eight different designs. The surface is elevated over a plane defined by the values of prismatic coefficient and L/B ratio.
Figure 8. Probability in a Cp(L/B) plane 
In a more qualitative sense, any trade off usually identified in ship design can be analyzed from the perspective of this method. It is possible for example to study the probabilistic behaviour of the boat on the windward leg compared to the leeward leg. In other words, it is possible to study the influence of performance offthewind versus performance intothewind in the total probability. (see figures 9, 10 and 11)
Figure 9. Victory Probability. Upwind Leg. 
Figure 10. Victory Probability. Downwind Leg. 
Figure 11. Victory Probability Upwind Leg + Downwind Leg. 
The way the analysis is carried out; there is another piece of information that may result useful. This refers to course sensibility. This sensibility can be thought of as a measure of the boat’s ability to stay in a narrow band of times for a course other than the optimum (figure of sensibilities). Obviously, a boat with a smaller sensibility is a better boat (probability of winning being the same). Thus, it is possible to qualitatively account for strategic mistakes or other perturbations of the course. There are two different uses of this data.
Figure 12. Number of Victories. Upwind and Downwind with Perturbation for Yacht E. 
For one, we can perturbate the route of all boats under consideration by the same amount and check what the effect is on the general probability distribution. It may be the case that a boat that had a high probability before is too coursedependent and would have to be sailed perfectly in the tactical sense.
It is also possible to vary the course of just one boat keeping the others constant. This way, we could see how the boat handles courses other than the optimum when sailed against boats that follow that perfect route (see Figure 13).
Figure 13. Victory Probability. Upwind and Downwind with Perturbation for Yacht E. 
Later on the design cycle, once we have a couple of candidates, the simulation may be used to study the probability of victory of one design over the other. It is also possible to consider variations in configuration so that we could cover strong spots of the opponent. This would be reflected as a higher probability of winning and would give a measure of up to what point to give up big advantages in certain conditions to gain in others.
Figure 14. Victory Probability. Upwind and Downwind Yacht C VS Yacht E. 
As an example, we simulate a matchrace between yacht C and yacht E that seemed the most promising (see Figure 14). After this, while the probability of winning of yacht E stays practically the same, the probability of yacht C increasing notably. Therefore, yacht C losses lots of races to the other design but pretty much a fixed number to design E.
Finally, once we know our opponent it could be feasible to study our probability of beating them. This could be study from the perspective of the statistical whether data or a more concrete description of the expected weather. A good tool to use in this sense is an ensemble weather forecast. This would give us a probabilistic prediction of the wind field based on the probabilities of different weather scenarios. We could then simulate the race based on this new and more concrete stochastic data.
The purpose of this paper was to present a different approach to the design of a high performance racing sailboat. It may be somehow simplistic in certain aspects, but none the less we believe that it gives a good perspective of what can be done. On of the possible improvements would be to simulate the effects of tacking and currents. Another possible direction of work would be to introduce the effects of another boat or a fleet of them in the route chosen to sail the race course.
The weakest link in this kind of analysis though resides in the weather data. An accurate stochastic description of the race field is needed to get an accurate probabilistic response of the boats. Recent improvements in weather analysis methods and the wealth of new data continuously made available from weather analysts looks poised to solve this problem.
The authors want to thank Mr. Manuel López for their technical assistance in the preparation of this work.