daisy(x, metric = "euclidean", stand = F, type = list())
In the daisy algorithm, missing values in a row of x are not included in the dissimilarities involving that row. If all variables are interval scaled, the metric is "euclidean", and ng is the number of columns in which neither row i and j have NAs, then the dissimilarity d(i,j) returned is sqrt(ncol(x)/ng) times the Euclidean distance between the two vectors of length ng shortened to exclude NAs. The rule is similar for the "manhattan" metric, except that the coefficient is ncol(x)/ng. If ng is zero, the dissimilarity is NA.
When some variables have a type other than interval scaled, the dissimilarity between two rows is the weighted sum of the contribution of each variable. The weight becomes zero when that variable is missing in either or both rows, or when the variable is asymmetric binary and both values are zero. In all other situations, the weight of the variable is 1. The contribution of nominal or binary variable a to the total dissimilarity is zero if both values are different, else it is equal to 1. The contribution of other variables is the absolute difference of both values, divided by the total range of that variable. Ordinal variables are first converted to ranks. If nok is the number of nonzero weights, the dissimilarity is multiplied by the factor 1/nok and thus ranges between 0 and 1. If nok is zero, the dissimilarity is NA.
daisy(x, metric = "manhattan", stand = T) # if all columns are interval scaled variables daisy(x, type = list(logratio = c(2,5)) # if columns 2 and 5 must be logaritmically transformed