Archive

Posts Tagged ‘Research’

An Introduction to Schmidt Norms

October 2nd, 2009

In [1], a family of matrix norms (called Schmidt norms) are studied and some of their uses in quantum information theory are explored. The interested reader is of course welcome to read the results presented in that paper, but for the more casual reader I present here one very crucial preliminary, the Schmidt decomposition theorem, and a proof that the Schmidt norms actually are (as their name suggests) norms.

Schmidt Decomposition Theorem

The Schmidt decomposition theorem says that any complex vector vCnCn can be written as

{\bf v}=\sum_{j=1}^k\alpha_j{\bf e_j}\otimes{\bf f_j}

where k ≤ n, {αj} ⊆ R is a family of non-negative real scalars, and {ej}, {fj} ⊆ Cn are two orthonormal sets of vectors. I won’t prove the theorem here — a proof can be found on its Wikipedia page (it’s basically the singular value decomposition in disguise). For our purposes the most important thing to realize is that, for some vectors v, we can write v in its Schmidt decomposition with k < n. The least k such that v can be written in the form above is called the Schmidt rank of v, and we denote it by SR(v). Every vector v has SR(v) ≤ n.

Schmidt Matrix Norms

The Schmidt k-norm of a matrix X ∈ Mn is defined to be

\big\|X\big\|_{S(k)}:=\sup_{{\bf v},{\bf w}}\big\{|{\bf w}^*X{\bf v}| : \|{\bf v}\|,\|{\bf w}\|\leq 1,SR({\bf v}),SR({\bf w})\leq k\big\}

That might look like a horribly complex definition upon first glance, but it’s not so hard to get your head around when you realize that the Schmidt k-norm for k = n is simply the standard operator norm of X. It is clear then that the Schmidt k-norm for k < n must be a smaller quantity. Indeed, from a quantum information perspective, the norm measures how much the operator represented by X can stretch pure states that “aren’t very entangled.” The interested reader can learn about the various properties and applications of these norms in [1] — what I present here is simply a proof that the Schmidt k-norm is indeed a norm (since this is not explicitly done in the paper).

Proof that the Schmidt k-norm is a norm. It is clear from the definition that the absolute value of a constant pulls out of the Schmidt norms and that the Schmidt norms satisfy the triangle inequality. The only challenging property of the norm to verify is that the Schmidt norm of X being zero implies X = 0.

To prove this, assume that we are in the k = 1 case (if we can show that this property holds for k = 1, it immediately follows that the same property must hold for k > 1). Then recall that we can write X as the sum of elementary tensors, so we can write

X=\sum_jA_j\otimes B_j,\ \ {\bf v}={\bf v_1}\otimes{\bf v_2},\text{ and } \ {\bf w}={\bf w_1}\otimes{\bf w_2}.Furthermore, we may write X in this way using matrices Bj that are linearly independent (see, for example, Proposition 24 of [2], or simply note that you could choose them to be a family of matrix units). Thus, if the Schmidt 1-norm of X equals zero, then it follows that for any v1, v2, w1, and w2:

{\bf w_2}^*\Big(\sum_jc_jB_j\Big){\bf v_2}=0 \ \text{ where }c_j={\bf w_1}^*A_j{\bf v_1} \ \ \forall \, j.

Since this holds for any v2 and w2, it follows that

\sum_jc_jB_j=0.

Because we chose the Bj matrices to be linearly independent, it follows that cj = 0 for all j. By referring back to the definition of cj, we see that this then implies Aj = 0 for all j, so X = 0 as desired. QED.

References:

  1. N. Johnston and D. W. Kribs, A family of norms with applications in quantum information theory. Journal of Mathematical Physics 51, 082202 (2010). arXiv:0909.3907 [quant-ph]
  2. Johnston, N., Kribs, D. W., and Paulsen, V., Computing stabilized norms for quantum operations. Quantum Information & Computation 9 1 & 2, 16-35 (2009). arXiv:0711.3636v1 [quant-ph]

Quantum Semidefinite Programs

September 25th, 2009

In quantum information theory, semidefinite programs are often useful, as one is often interested in the behaviour of linear maps over convex sets. For example, they have very recently been used to compute the completely bounded norm of a linear map [1], prove that QIP = PSPACE [2], and bound a new family of norms of operators [3]. However, if you were to look at the standard form of a semidefinite program provided on the Wikipedia page linked above, you would likely only see some very superficial connections with the standard form of quantum semidefinite programs in references [1-3] — this post aims to bridge that gap and show that the two forms are indeed equivalent (or at the very least outline the key steps in proving they are equivalent).

The Quantum Form

Let Mn denote the space of n×n complex matrices. Assume that we are given Hermitian matrices A = A* ∈ Mn and B = B* ∈ Mm, as well as a Hermicity-preserving linear map Φ : Mn → Mm (i.e., a map such that Φ(X) is Hermitian whenever X is Hermitian). Then we can define a quantum semidefinite program to be the following pair of optimization problems:

Quantum Semidefinite Program

In the dual problem, Φ refers to the dual map of Phi — that is, the adjoint map in the sense of the Hilbert-Schmidt inner product. It is not surprising that many problems in quantum information theory can be formulated as an optimization problem of this type — completely positive maps (a special class of Hermicity-preserving maps) model quantum channels, positive semidefinite matrices represent quantum states, and the trace of a product of two positive semidefinite matrices represents an expectation value.

The Standard Form

In the more conventional set up of semidefinite programming, we are given matrices D and {G_i} ∈ Mr and a complex vector c ∈ Cs. The associated semidefinite program is given by the following pair of optimization problems:

Semidefinite Programming Standard Form

The interested reader should read on Wikipedia about how semidefinite programs generalize linear programs and how their theory of duality works. It is also important to note that semidefinite programs can be solved efficiently to any desired accuracy by a variety of different solvers, using a number of different algorithms. Thus, once we show that quantum semidefinite programs can be put into this standard form, we will be able to efficiently solve quantum semidefinite programs.

Converting the Quantum Form to the Standard Form

Define a linear map Ψ : Mn → (Mm ⊕ Mn) by

Psi

Then the requirement that $\Phi(P) \leq B$ and $P \geq 0$ is equivalent to
\[
\Psi(X) \leq \begin{bmatrix}B & 0 \\ 0 & 0 \end{bmatrix}.

Then the requirement that Ψ(P) ≤ B and P ≥ 0 is equivalent to

Psi Inequality

The dual map Ψ is given by

Psi Dual

By putting these last few steps together, we see that our original quantum semidefinite program is of the following form:

Simplified Quantum SDP

The inequality in the dual problem was able to be replaced by equality because of the flexibility that was introduced by the arbitrary positive operator R. Now let {Ea} and {Fa} be families of left and right generalized Choi-Kraus operators for Ψ. Denote the (k,l)-entry of P by pkl and the (i,j)-entry of Ea or Fa by eaij or faij, respectively. Then

Psi Reductionwhere

G_{kl}

Finally, defining x := vec(P) and c := vec(A) (where vec refers to the vectorization of a matrix, which stacks each of its columns on top of each other into a column vector) shows that the quantum primal problem is in the form of the standard primal problem. Some simple linear algebra can be used to show that the quantum dual form reduces to the standard dual form as well.

Downloads:

References:

  1. J. Watrous, Semidefinite programs for completely bounded norms. Preprint (2009). arXiv:0901.4709 [quant-ph]
  2. R. Jain, Z. Ji, S. Upadhyay, J. Watrous, QIP = PSPACE. Preprint (2009). arXiv:0907.4737 [quant-ph]
  3. N. Johnston and D. W. Kribs, A family of norms with applications in quantum information theory. Journal of Mathematical Physics 51, 082202 (2010). arXiv:0909.3907 [quant-ph]

A Brief Introduction to the Multiplicative Domain and its Role in Quantum Error Correction

July 24th, 2009

Given a completely positive linear map E: Mn → Mn, its multiplicative domain, denoted MD(E), is an algebra defined as follows:

MD1

Roughly speaking, MD(E) is the largest subalgebra of Mn on which E behaves multiplicatively. It will be useful to make this notion precise:

Definition. Let A be a subalgebra of Mn and let π : A → Mn. Then π is said to be a *-homomorphism if π(ab) = π(a)π(b) and π(a*) = π(a)* for all a,b ∈ A.

Thus, MD(E) is roughly the largest subalgebra of Mn such that, when E is restricted to it, E is a *-homomorphism (I keep saying “roughly speaking” because of the “∀b ∈ Mn” in the definition of MD(E) — the definition of a *-homomorphism only requires that the multiplicativity hold ∀b ∈ A). Probably the most well-known result about the multiplicative domain is the following theorem of Choi [1,2], which shows how the multiplicative domain simplifies when E is such that E(I) = I (i.e., when E is unital):

Theorem [Choi]. Let E: Mn → Mn be a completely positive map such that E(I) = I. Then

MD2

Let $\phi : \cl{L}(\cl{H}) \rightarrow \cl{L}(\cl{H})$ be a completely positive, unital map. Then
\begin{align*}
MD(\phi) = & \big\{ a \in \cl{L}(\cl{H}) : \phi(a)^{*}\phi(a) =
\phi(a^*a)\text{ and } \phi(a)\phi(a)^{*} =
\phi(aa^*)\big\}.
\end{align*}

One thing in particular that this theorem shows is that, when E(I) = I, the multiplicative domain of E only needs to be multiplicative within MD(E) (i.e., we can remove the “roughly speaking” that I spoke of earlier).

MD(E) in Quantum Error Correction

Before moving onto how MD(E) plays a role in quantum error correction, let’s consider some examples to get a better feeling for what the multiplicative domain looks like.

  • If E is the identity map (that is, it is the map that takes a matrix to itself) then MD(E) = Mn, the entire matrix algebra.
  • If E(a) = Diag(a) (i.e., E simply erases all of the off-diagonal entries of the matrix a), then MD(E) = {Diag(a)}, the set of diagonal matrices.

Notice that in the first example, the map E is very well-behaved (as well-behaved as a map ever could be); it preserves all of the information that is put into it. We also see that MD(E) is as large as possible. In the second example, the map E does not preserve information put into it (indeed, one nice way to think about matrices in the quantum information setting is that the diagonal matrices are “classical” and rest of the matrices are “quantum” — thus the map E(a) = Diag(a) is effectively removing all of the “quantumness” of the input data). We also see that MD(E) is tiny in this case (too small to put any quantum data into).

The above examples then hint that if the map E preserves quantum data, then MD(E) should be large enough to store some quantum information safely. This isn’t quite true, but the intuition is right, and we get the following result, which was published as Theorem 11 in this paper:

Theorem. Let E: Mn → Mn be a quantum channel (i.e., a completely positive map such that Tr(E(a)) = Tr(a) for all a ∈ Mn) such that E(I) = I. Then MD(E) = UCC(E), the algebra of unitarily-correctable codes for E.

What this means is that, when E is unital, its multiplicative domain encodes exactly the matrices that we can correct via a unitary operation. This doesn’t tell us anything about correctable codes that are not unitarily-correctable, though (i.e., matrices that can only be corrected by a more complicated correction operation). To capture these codes, we have to generalize a bit.

Generalized Multiplicative Domains

In order to generalize the multiplicative domain, we can require that the map E be multiplicative with another map π that is already a *-homomorphism, rather than require that it be multiplicative with itself. This is the main theme of this paper, which was submitted for publication this week. We define generalized multiplicative domains as follows:

Definition. Let A be a subalgebra of Mn, let E : Mn → Mn be completely positive, and let π : A → Mn be a *-homomorphism. Then the multiplicative domain of E with respect to π, denoted MDπ(E), is the algebra given by

MD3

It turns out that these generalized multiplicative domains are reasonably well-behaved and generalize the standard multiplicative domain in exactly the way that we wanted: they capture all correctable codes for arbitrary quantum channels (see Theorem 11 of the last paper I mentioned). Furthermore, there are even some characterizations of MDπ(E) analogous to the theorem of Choi above (see Theorems 5 and 7, as well as Corollary 12).

References:

  1. M.-D. Choi, A Schwarz inequality for positive linear maps on C*-algebras. Illinois Journal of Mathematics, 18 (1974), 565-574.
  2. V. I. Paulsen, Completely Bounded Maps and Operator AlgebrasCambridge Studies in Advanced Mathematics 78, Cambridge University Press, Cambridge, 2003.

Unital Channel Eigenvalue Majorization

June 6th, 2009

I’ve decided that, starting today, I will once a month post a mathematical result that I find interesting and/or useful, but I feel sadly gets less attention than it deserves. I will try to present all relevant preliminaries along with the result to provide context, so hopefully the results and proofs will be accessible to someone at the upper undergraduate level.

Since I’m a quantum kinda guy, it seems natural that the first such lemma deals with quantum information science. In particular, it helps quantify the behaviour of unital quantum channels acting on density operators. Before delving into the result, let’s begin with…

Quantum Preliminaries

Given the complex matrix space Mn, a quantum channel E is defined to be a completely positive, trace-preserving map. That is, it is a map of the form

Choi-Kraus representationwhere {Aj} ∈ Mn is a family of matrices. Trace-preservation of E is equivalent to the requirement that

Trace-preservationIn many physical situations we are interested in unital quantum channels; that is, channels that satisfy E(I) = I. Such channels in general exhibit much nicer behaviour than arbitrary quantum channels, and this month’s lemma will show one particular instance of this fact.

The Hardy-Littlewood-Polya Theorem

The proof of the lemma relies on a classical result known as the Hardy-Littlewood-Polya Theorem. The result explains how doubly stochastic matrices act on vectors. Since it seems to be surprisingly difficult to find on Wikipedia and other popular (read: non-research) websites, I will state it here.

Theorem [Hardy-Littlewood-Polya]. Let x,y ∈ Rn be real vectors. Then x majorizes y if and only if y = Dx for some doubly stochastic matrix D ∈ Mn.

It might be worth mentioning that the “if” direction of the proof is borderline trivial; the real meat and potatoes of the theorem is the “only if” direction.

The Lemma Itself

The lemma makes precise something that feels quite natural when thought of physically: a unital channel (that is, a completely positive, trace-preserving map E for which E(I) = I) can only increase the impurity (or “mixedness”) of quantum states. It has several simple consequences that are of great use when dealing with unital channels, and furthermore its proof makes excellent use of classical machinery. It was originally due to Uhlmann [1,2], but has recently appeared in [3]. The proof provided in the PDF attached at the end of this post is from the latter source.

Lemma [Unital Channel Eigenvalue Majorization]. Suppose ρ = E(σ) for a unital channel E. Then the ordered spectrum r of ρ is majorised by the ordered spectrum s of σ.

One particularly useful corollary of this lemma is presented here, and its proof is omitted (and dare I say left as an exercise for the reader?)

Corollary. If E is a unital quantum channel and ρ is a positive operator, then rank(E(ρ)) ≥ rank(ρ).

Related Links

References

  1. A. Uhlmann, Commun. Math. Phys. 54, 21 (1977).
  2. I. Bengtsson, and K. Zyczkowski, Geometry of quantum states, Cambridge University Press (2006).
  3. D. W. Kribs, R. W. Spekkens, Phys. Rev. A 74, 042329 (2006). arXiv:quant-ph/0608045v2