If there exists two disjoint convex sets C,D then there exists a hyperplane a⊤x=b that seperates the two sets.
Define the two points c∈C and d∈D that are closest to each other, and let it be the distance between the two sets:
dist(C,D)=inf{∣∣c−d∣∣2∣c∈C,d∈D} The inf function denotes the infimum ⇒ the largest lower bound of ∣∣c−d∣∣2
Then there exists a seperating function:
f(x)=(d−c)⊤(x−2d+c)=0 And we will see that
∀c∈C,f(c)≤0∀d∈D,f(d)≤0 We will prove those constraint by contradiction:
Assue ∃u∈D such that f(u)<0
f(u)=(d−c)⊤(u−2d+c)=(d−c)⊤((u−d)+21(d−c))=<d−c,u−d>+we know that this is ≥021∣∣d−c∣∣22 So implies that for f(u)<0, <d−c,u−d> has to be lower than 0
%20bf04e6495e4e49659ed4cf6b446d8ed7/Untitled.png)
Because this inner product is negative, so if I move along the vector u−d for some distance t, then I may get closer to c
Let
p=d+t(u−d)=tu+(1−t)d If t∈(0,1) then p∈D (Since D is convex)
Consider
∣∣c−p∣∣22=∣∣c−d−t(u−d)∣∣2=((d−d)−t(u−d))⊤((c−d)−t(u−d))=∣∣c−d∣∣2+t2∣∣u−d∣∣2−2t<c−d,u−d> We want to show that p is closer than d to c
So let’s write the next two terms
=t2∣∣u−d∣∣2−2t<c−d,u−d>t2∣∣u−d∣∣2+2tstrictly <0<d−c,u−d> But we can always choose a value of t such that the first term is always smaller in magnitude
t<−∣∣u−d∣∣22<d−c,u−d> So we have proved a contradiction (because now the distance between c and d is no longer the minimal)